Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
LINEヤフーの音声AIがもたらす未来:ASR/TTSと対話技術の新たな可能性 / LY Co...
Search
LINEヤフーTech (LY Corporation Tech)
PRO
July 01, 2025
Technology
0
54
LINEヤフーの音声AIがもたらす未来:ASR/TTSと対話技術の新たな可能性 / LY Corporation's Speech AI Vision: Towards Realtime Spoken Dialogue through Advanced ASR and TTS
LINEヤフーの音声認識と音声合成技術を活用した応用事例と、近年注目されているLLM基盤のリアルタイム音声対話技術の自社の取り組みについて紹介します。
LINEヤフーTech (LY Corporation Tech)
PRO
July 01, 2025
Tweet
Share
More Decks by LINEヤフーTech (LY Corporation Tech)
See All by LINEヤフーTech (LY Corporation Tech)
Yahoo!しごとカタログ 新しい境地を創るエンジニア募集!
lycorptech_jp
PRO
2
290
データグループにおけるフロントエンド開発
lycorptech_jp
PRO
2
240
Yahoo!知恵袋におけるフロントエンド開発
lycorptech_jp
PRO
0
240
"LINE Planet" and AI: Conversations with AI
lycorptech_jp
PRO
0
50
Seamless inventory management with AI
lycorptech_jp
PRO
0
24
AI Frontiers Revealed: Transforming LINE Shopping TW with LLM-Driven Product Attribute Extraction
lycorptech_jp
PRO
0
42
「Yahoo!検索」におけるWebパフォーマンス改善の取り組み / Efforts to Improve Web Performance in "Yahoo! JAPAN Search"
lycorptech_jp
PRO
1
62
アクセシビリティ改善の実践:プロダクトにおける具体的な取り組みと課題 / Practices for Accessibility Improvement: Specific Efforts and Challenges in Products
lycorptech_jp
PRO
0
57
「PayPayゲートウェイ」におけるStorybook活用事例 / Introducing Storybook: Enhancing Development in "PayPay Gateway"
lycorptech_jp
PRO
0
130
Other Decks in Technology
See All in Technology
Four Keysから始める信頼性の改善 - SRE NEXT 2025
ozakikota
0
430
ゼロから始めるSREの事業貢献 - 生成AI時代のSRE成長戦略と実践 / Starting SRE from Day One
shinyorke
PRO
0
160
ObsidianをLLM時代のナレッジベースに! クリッピング→Markdown→CLI連携の実践
srvhat09
6
3.7k
ソフトウェアQAがハードウェアの人になったの
mineo_matsuya
3
230
アクセスピークを制するオートスケール再設計: 障害を乗り越えKEDAで実現したリソース管理の最適化
myamashii
1
740
SRE with AI:実践から学ぶ、運用課題解決と未来への展望
yoshiiryo1
1
450
Digitization部 紹介資料
sansan33
PRO
1
4.5k
Frontier Airlines Customer®️ USA Contact Numbers: Complete 2025 Support Guide
frontierairlineswithflyagent
0
100
[SRE NEXT 2025] すみずみまで暖かく照らすあなたの太陽でありたい
carnappopper
2
580
Talk to Someone At Delta Airlines™️ USA Contact Numbers
travelcarecenter
0
160
AIでテストプロセス自動化に挑戦する
sakatakazunori
1
570
本当にわかりやすいAIエージェント入門
segavvy
7
4.1k
Featured
See All Featured
The Art of Programming - Codeland 2020
erikaheidi
54
13k
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
130
19k
How STYLIGHT went responsive
nonsquared
100
5.6k
Fight the Zombie Pattern Library - RWD Summit 2016
marcelosomers
234
17k
Code Reviewing Like a Champion
maltzj
524
40k
How to Ace a Technical Interview
jacobian
278
23k
Scaling GitHub
holman
460
140k
Fireside Chat
paigeccino
37
3.5k
Agile that works and the tools we love
rasmusluckow
329
21k
Fashionably flexible responsive web design (full day workshop)
malarkey
407
66k
Building an army of robots
kneath
306
45k
The Web Performance Landscape in 2024 [PerfNow 2024]
tammyeverts
8
700
Transcript
-:$PSQPSBUJPOT4QFFDI"*7JTJPO 5PXBSET3FBMUJNF4QFFDIUP4QFFDI UISPVHI"EWBODFE"43BOE554 4QFFDIBOE"DPVTUJD"*%FQU %BUB4DJFODF(SPVQ +VNQFJ .JZBLF 5BJLJ,JOPTIJUB -*/&ϠϑʔͷԻ"*͕ͨΒ͢ະདྷɿ"43554ͱରٕज़ͷ৽ͨͳՄೳੑ
"HFOEB -:$PSQPSBUJPO`T4QFFDI"* -*/&ϠϑʔͷԻ"*ʹ͍ͭͯ -:$PSQPSBUJPO`T"43554 -*/&ϠϑʔͷԻೝࣝɾԻ߹ͷհ 3FBMUJNF4QFFDIUP4QFFDI ϦΞϧλΠϜ4QFFDIUP4QFFDIٕज़։ൃͷऔΓΈʹ͍ͭͯ
'VUVSF8PSLT ࠓޙͷల
-:$PSQPSBUJPO`T4QFFDI"* -*/&ϠϑʔͷԻ"*ʹ͍ͭͯ 7JEFPBOE"VEJP $POUFOU"OBMZTJT 4QFFDI 3FDPHOJUJPO 4QFFDI (FOFSBUJPO 7JEFP"VEJP$POUFOUT $BMM$FOUFS
.FFUJOH 7PJDF6TFS*OUFSGBDF 7JEFP"VEJP$POUFOUBOE$BMM"OBMZTJT ࣸਅૉࡐఏڙΞϑϩ
-*/&ϠϑʔͷԻೝࣝͱԻ߹ͷհ *OUSPEVDUJPOPG -:$PSQPSBUJPO`T"43554 "43"VUPNBUJD4QFFDI3FDPHOJUJPO 5545FYU5P4QFFDI
:+70*$&4USFBNJOH"43 :+70*$&ετϦʔϛϯάԻೝࣝ &GGJDJFOUMZBEBQUTUPUBSHFUEPNBJOT • "TUSBUFHZCBTFEPODPNQBDUNPEFMTXJUIPVU FYUFSOBMMBOHVBHFNPEFMT • %PNBJOBEBQUBUJPOXJUIPVUUBSHFUBVEJPEBUB 1BJSFETQFFDIUFYUEBUB 6OQBJSFEUFYUEBUB
#BTF.PEFM "EBQUBUJPO .PEFM 4QFFDI 5FYU 5FYU #PPTUTQISBTFXJUIVTFSEJDUJPOBSJFT 4QFFDI 3FDPHOJUJPO 8PVMEZPV MJLFUPTUBSU UIFOBWJHBUJPO WJBUIJTSPVUF 4FSWJDF4QFDJGJD %JDUJPOBSZ :FT /P 1SJPSJUJ[FFYQSFTTXBZT 1SJPSJUJ[FHFOFSBMSPBET ʷ :FBTU ˠ ˓ :FT ʷ ,OPX ˠ ˓ /P ʷ 1SJPSJUJ[FHFOFSBMMPBET ˣ ˓ 1SJPSJUJ[FHFOFSBM SPBET 3FTPMWFTIPNPOZNT ຊ ڮ χ ϗ ϯ ό γ · Ͱ Ϛ σ ͷ ϊʜ 4VSGBDF 3FBE 4VSGBDF 3FBE &OEUP&OE "43 4QFFDI • JF ɾຊڮ χϗϯόγ JTBMPDBUJPOJO5PLZP ɾຊڮ χοϙϯόγ JTBMPDBUJPOJO0TBLB • +PJOUQSFEJDUJPOPGCPUITVSGBDFBOESFBEJOH ಉදهҟԻޠ ޮతͳυϝΠϯదԠ ಈతϢʔβࣙॻʹΑΔϑϨʔζೝࣝڧԽ ˞"CPVU'FBUVSF 'FBUVSF)JHIBDDVSBDZGPSXFCTFBSDIBOE-:$PSQPSBUJPOEPNBJO 'FBUVSF 3FTPMWFTIPNPOZNTBOEDVTUPNJ[FTFBTJMZ 'FBUVSF1SPWJEFT8FC"1*BOEPOEFWJDFNPEVMFT
"DIPSJT&YQSFTTJWF554 "DIPSJT දݱྗ͕๛͔ͳԻ߹ 'FBUVSF$POUSPMFNPUJPOJOUFOTJUZXJUIFYQSFTTJPOTUZMFT 'FBUVSF QSFTFUTQFBLFSPQUJPOTXJUIIVNBOMJLFRVBMJUZ 'FBUVSF1SPWJEFT8FC"1* POEFWJDFNPEVMFTBOEFEJUJOHXFCUPPMT "DIPSJT &EJUPS5FYUUPTQFFDIFEJUJOHUPPM
"DIPSJT &YQSFTTJWFUFYUUPTQFFDI $POUSPM0WFS4QFBLFS &NPUJPO BOE*OUFOTJUZ
ԻೝࣝͷαʔϏε׆༻ࣄྫ • :BIPP +"1"/"QQ`T7PJDF4FBSDI • 7PJDF4FBSDIJTJNQMFNFOUFEJONPTU:BIPP+"1"/4FSWJDFT JFTFSWJDFTJODMVEJOH.BQT 5SBOTJU BOETIPQQJOH :BIPP+"1"/"QQ
J04"OESPJE &YBNQMFTPG"QQMJDBUJPO
Ի߹ͷαʔϏε׆༻ࣄྫ • /BWJHBUJPOWPJDFJO:BIPP+"1"/$BS/BWJHBUJPO"QQ • 0O%FWJDF/FVSBM5FYU5P4QFFDI&OHJOF DBMMFEl"DIPSJT -JUFz • (FOFSBUFBTFDPOEBVEJPXBWFGPSNJO
TFDPOET ˎ 3FBM5JNF'BDUPS 35' JTJOJ1IPOF • 5PNJOJNJ[FBQQTJ[F XFWFJNQMFNFOUFE WBSJPVTPQUJNJ[BUJPOTJOCPUIJOGFSFODFMJCSBSJFTBOENPEFMTJ[F :BIPP+"1"/$BS/BWJHBUJPO"QQ &YBNQMFTPG"QQMJDBUJPO
"MBCBQQGPSSFBMUJNFTQPLFOEJBMPHVF CBTFEPOB--. VOEFSEFWFMPQJOH IUUQTXXXMZDPSQDPKQKBUFDIOPMPHZEFTJHOMBCT &YBNQMFTPG"QQMJDBUJPO ϦΞϧλΠϜԻରͷ࣮ݧΞϓϦ ։ൃத
ϦΞϧλΠϜ4QFFDIUP4QFFDIٕज़։ൃͷऔΓΈ 3FBMUJNF4QFFDIUP4QFFDI
3FBMUJNF4QFFDIUP4QFFDI5SFOE ϦΞϧλΠϜ4QFFDIUP4QFFDIͷٕज़ಈ IUUQTPQFOBJDPNKB+1DIBUHQUPWFSWJFX IUUQTHFNJOJHPPHMFPWFSWJFXHFNJOJMJWF IUUQTNPTIJDIBU IUUQTOVEJBMPHVFHJUIVCJPKNPTIJ
0QFO"* $IBU(15 "EWBODFE7PJDF.PEF ,ZVUBJ .PTIJ /BHPZBVOJW +.PTIJ (PPHMF (FNJOJ-JWF
4QFFDIUP4QFFDI"SDIJUFDUVSF 4QFFDIUP4QFFDIͷϞσϧߏ -BSHF-BOHVBHF.PEFM 5FYU(VJEFE4QFFDI (FOFSBUJPO Low-latency speech generation using audio
tokens or a streaming TTS module "VEJP"EBQUFS 4QFFDI &ODPEFS .PEBMJUZBMJHONFOU CFUXFFOUFYUBOEBVEJP 1SPNQU 4QFFDI 4JOHMF4USFBN 6TFSTTQFFDIPOMZ .VMUJ4USFBN 6TFSTTQFFDI --.HFOFSBUFETQFFDI
1SPTPG*OUFHSBUJOH4QFFDI&ODPEFS XJUI--.T Իͱ--.Tͷ౷߹ʹΑΔར -FWFSBHF--. $BQBCJMJUJFT 1SPNQU%SJWFO 'MFYJCJMJUZ #ZQBTT "43&SSPST --.ͷߴͳج൫ೳྗͷ׆༻
ϓϩϯϓτʹΑΔߴ͍ΧελϚΠζੑ ԻೝࣝޡΓͷӨڹΛճආ
&WBMVBUJPOPG5BTL1FSGPSNBODF 4QFFDI--.ͷλεΫੑೳͷධՁ JOQVU +42V"% 2VFTUJPO"OTXFS DIBS@G "-5 5SBOTMBUJPOGSPNKQ UPFO
#FSU4DPSF (SPVOEUSVUIUFYU UFYUUPUFYU 5SBOTDSJCFEUFYU UFYUUPUFYU 4QFFDI TQFFDIUPUFYU --.HFNNBCJU "43NPEFMXIJTQFSTNBMM 4QFFDI&ODPEFSXIJTQFSTNBMM 5SBJOJOH5PPMLJU4-".--. &WBMVBUJPO5PPMLJUMMNKQFWBM 4-".--.IUUQTHJUIVCDPN9-"/$&4-".--. MMNKQFWBMIUUQTHJUIVCDPNMMNKQMMNKQFWBM $PNQBSBCMF QFSGPSNBODFPO USBOTMBUJPO UBTL #FUUFS QFSGPSNBODFPO 2"UBTL
&WBMVBUJPOPG*OGFSFODF4QFFE ਪͷධՁ 0 10 20 30 40 50
60 vllm slam-llm (transformers) Generated Characters per Second W--.,XPO 8PPTVL FUBM&GGJDJFOUNFNPSZNBOBHFNFOUGPSMBSHFMBOHVBHFNPEFMTFSWJOHXJUIQBHFEBUUFOUJPO1SPDFFEJOHTPGUIFUI4ZNQPTJVNPO0QFSBUJOH4ZTUFNT1SJODJQMFT 'BTUFS YGBTUFS (FOFSBUFT)PXDBO*IFMQZPVUPEBZ JOTFDPOET /VNCFSPG5PLFOT
'VUVSF8PSLT ࠓޙͷల 8FBSFEFWFMPQJOH • 3FBMUJNF4QFFDIUP4QFFDI JOUFHSBUJPOXJUI--. • .VMUJMJOHVBM4QFFDI5P5FYU5FYU5P4QFFDI 7PJDF$POUSPMJOB$BS )VNBOMJLFBOE/BUVSBM
$POWFSTBUJPOBM4FBSDI 4QPLFO%JBMPHVFWJB$BMM "*"HFOU 03 4FBSDI 8FBUIFS 1PEDBTU "*"HFOU "*"HFOU "*"HFOU
EOP