Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Tokyo BISH Bash #02 音声情報処理と音声変換技術入門
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Akira Tamamori
June 30, 2020
Research
2.3k
2
Share
Tokyo BISH Bash #02 音声情報処理と音声変換技術入門
Akira Tamamori
June 30, 2020
More Decks by Akira Tamamori
See All by Akira Tamamori
音声認識と音声合成の超入門
tam17aki
0
520
音声情報処理に便利な (Python) パッケージやソフトウェア
tam17aki
3
960
[ICASSP2020音響音声読み会] State-Space Gaussian Process for Drift Estimation in Stochastic Differential Equations
tam17aki
0
590
Other Decks in Research
See All in Research
Can We Teach Logical Reasoning to LLMs? – An Approach Using Synthetic Corpora (AAAI 2026 bridge keynote)
morishtr
1
250
YOLO26_ Key Architectural Enhancements and Performance Benchmarking for Real-Time Object Detection
satai
3
770
SoftMatcha 2: 1兆語規模コーパスの超高速かつ柔らかい検索
e869120_sub
6
3.4k
Using our influence and power for patient safety
helenbevan
0
360
Cross-Media Information Spaces and Architectures
signer
PRO
0
290
CyberAgent AI Lab研修 / Social Implementation Anti-Patterns in AI Lab
chck
7
4.6k
Dual Quadric表現を用いた動的物体追跡とRGB-D・IMU制約の密結合によるオドメトリ推定
nanoshimarobot
0
400
第12回人と環境にやさしい交通をめざす全国大会/熊本都市圏「車1割削減、渋滞半減、公共交通2倍」をめざして
trafficbrain
0
110
言語モデルから言語について語る際に押さえておきたいこと
eumesy
PRO
5
2.3k
LLM の Attention 機構まとめ — 数式・計算量・メモリ
puwaer
7
2k
論文紹介 "ReSim: Reliable World Simulation for Autonomous Driving"
kogo
0
610
Harness Engineering and Al Agent
kzinmr
3
1.6k
Featured
See All Featured
What does AI have to do with Human Rights?
axbom
PRO
1
2.2k
A better future with KSS
kneath
240
18k
Git: the NoSQL Database
bkeepers
PRO
432
67k
Exploring the Power of Turbo Streams & Action Cable | RailsConf2023
kevinliebholz
37
6.5k
[Rails World 2023 - Day 1 Closing Keynote] - The Magic of Rails
eileencodes
38
2.9k
The B2B funnel & how to create a winning content strategy
katarinadahlin
PRO
1
380
Efficient Content Optimization with Google Search Console & Apps Script
katarinadahlin
PRO
1
590
The Director’s Chair: Orchestrating AI for Truly Effective Learning
tmiket
1
180
Primal Persuasion: How to Engage the Brain for Learning That Lasts
tmiket
0
360
Fireside Chat
paigeccino
42
3.9k
The Language of Interfaces
destraynor
162
27k
GraphQLとの向き合い方2022年版
quramy
50
15k
Transcript
Իมٕज़ೖ ʙԻใॲཧೖΛఴ͑ͯʙ Tokyo BISH Bash #02 2020/06/30
࣍ • Իใॲཧೖʢʙ5ʣ • Իมٕज़ೖʢ20ʣ • ศརͳύοέʔδɾιϑτΣΞͷհʢΓ࣌ؒʣ ߴߍଔۀఔͷʮಈʯͷࣝલఏͱ͠·͢; ৼ෯ɺपͳͲ
ຊൃදͰԻΛྲྀ͠·͢ͷͰɺ͓खݩͷPCͷԻྔʹ͝ҙΛʂ 3
Իใॲཧೖ ԻͷʮใʯͬͯԿͧʁ
ԻͱԿ͔ʁ • ۭؒΛΘΔʮʯ • ۭؾࢠͷૈີ͕ΘΔʢഏ → → ޱ→ۭؒʣ •
ίϛϡχέʔγϣϯͷʮϝσΟΞʯ • ݴޠײɺऀੑͳͲΛ͑Δഔମ • จԽɾܳज़ʹ͓͚ΔʮՃՁʯ • ਓͷ৺Λಈ͔͢Վএ 5 Tokyo BISH Bash ! Tokyo BISH Bash ̇ Tokyo BISH Bash ͕ྑ͍Μͩͬͯ͞
Իʹؚ·ΕΔ̏ͭͷใ 6 ݴޠใ ύϥݴޠใ ඇݴޠใ ςΩετԽ͕Մೳ • ςΩετԽ͕ෆՄೳ • ऀ͕ҙਤతʹੜ
• ײଶͳͲ • ςΩετԽ͕ෆՄೳ • ऀ͕ҙਤ͠ͳ͍ • ྸੑผɺऀੑ Ի تͼͷ ײ ʮ"͞Μʯ ͱ͍͏ใ "͞Μ
ԻใॲཧͱԿ͔ʁ Իʹؚ·ΕΔʮ̏ͭͷใʯΛՃ/ಉఆ/ॲཧ͢Δٕज़ • Ի߹ɺԻೝࣝɺऀೝࣝɺײೝࣝɺԻมɺͳͲ ⇒ ຊߨԋʮԻมʯʢಛʹऀมʣʹϑΥʔΧε • ԻมɿݴޠใΛอ࣋ͭͭ͠ऀͷಛੑΛมߋ͢Δٕज़ 7 ऀͷมߋ
͠ํɾײͷมߋ ຊߨԋͷ ओ ͬͪ͜ˠ ඇݴޠใͷΈΛมߋ ύϥݴޠใͷΈΛมߋ
ԻใॲཧͱԿ͔ʁ Իʹؚ·ΕΔʮ̏ͭͷใʯΛՃ/ಉఆ/ॲཧ͢Δٕज़ • Ի߹ɺԻೝࣝɺऀೝࣝɺײೝࣝɺԻมɺͳͲ ⇒ ຊߨԋʮԻมʯʢಛʹऀมʣʹϑΥʔΧε • ԻมɿݴޠใΛอ࣋ͭͭ͠ऀͷಛੑΛมߋ͢Δٕज़ 8 มख๏໊
ݴޠใ ύϥݴޠ ใ ඇݴޠ ใ Իӆม ྫ Bˠ J มߋ ݻఆ ݻఆ ײม ྫ ت ˠ ౖ ݻఆ มߋ ݻఆ ऀม ྫ ঁੑ"ˠ உੑ# ݻఆ ݻఆ มߋ
Իมٕज़ͷجૅ ಛʹऀʢ࣭ʣมʹϑΥʔΧε
͡Ίʹ • όʔνϟϧYouTuberʢVTuberʣͷོ 10 %%ΩϟϥΫλʔ όʔνϟϧYouTuber
͡Ίʹ • உੑʢঁੑʣ ͕ঁੑʢஉੑʣΩϟϥΫλʔʹ ⇒ ϘΠενΣϯδϟʔͷར༻ 11 ϘΠενΣϯδϟʔ ϘΠενΣϯδϟʔͷΈ JTԿʁ
Իͷੜաఔʢͬ͘͟Γͱʣ • ָ࣮ثͱྨࣅ 12 ઉޱͷܗΛมܗ ଳͷ։ดʹΑΔ ۭؾͷৼಈ ϛοΫεʂ Ի৭ Իߴ
ʹͳͬͯग़ͯ͘Δ ์ࣹ ʮ͜Μʹͪʯ https://ahcweb01.naist.jp/lecture/2018/sp/ material/20181108_slide.pdf ࢀߟ ͜Μʹͪ
Իͷੜաఔʢपੳͷ؍͔Βʣ • पಛੑදݱ 13 ύϫʔ प Ի৭ͷ༩ʢڞ໐ʣ ͳΊΒ͔ ύϫʔ प
ԻߴͷੜʢԻݯʣ ΪβΪβ มௐ ύϫʔ प पಛੑ ˞ͳΊΒ͔ɿεϖΫτϧแབྷ ˞ΪβΪβɿඍࡉߏ ˞มௐɿΈࠐΈૢ࡞
σϞϯετϨʔγϣϯ XJUI1ZUIPO • ܗͷϦΞϧλΠϜϓϩοτʮ͍͋͏͓͑ʯ 14 ֤ԻͰҟͳΔܗ
σϞϯετϨʔγϣϯ XJUI1ZUIPO • FFTεϖΫτϧͷϦΞϧλΠϜϓϩοτʮ͍͋͏͓͑ʯ 15 εϖΫτϧแབྷʢͳΊΒ͔ʣ ʴඍࡉߏʢΪβΪβʣ
σϞϯετϨʔγϣϯ XJUI1ZUIPO • εϖΫτϧแབྷͷϦΞϧλΠϜϓϩοτʮ͍͋͏͓͑ʯ 16 εϖΫτϧแབྷʢͳΊΒ͔ʣ
ԻੜաఔͷϞσϧԽʢʣ • Իݯ৴߸ΛΠϯύϧεྻͱന৭ࡶԻͰදݱ • Πϯύϧεྻ͕༗ԻʢଳৼಈΛ͏ʣʹରԠ • ന৭ࡶԻ͕ແԻʢଳৼಈΛΘͳ͍ʣʹରԠ 17 https://ahcweb01.naist.jp/lecture/2018/sp/material/20181108_slide.pdf Πϯύϧεྻ
ˠ༗ԻʹରԠ ന৭ࡶԻ ˠແԻʹରԠ Իྔҙʂ
ԻੜաఔͷϞσϧԽʢʣ • εϖΫτϧแབྷԻڹͷ࿈݁Ͱදݱ • ͷͲͷܗͱͷ௨ΓಓʢಓʣΛදݱ • ͷͲͷܗͷݸਓࠩ = εϖΫτϧแབྷͷݸਓࠩ ⇒ʮ৭ʯ
18 ʮʯͷ࿈݁ ଳଆ ޱ৶ଆ प ύϫʔ εϖΫτϧแབྷ
ԻੜաఔͷϞσϧԽʢʣ • Իݯ৴߸ΛԻڹʹ௨͢⇒ ԻੜʢϑΟϧλϦϯάʂʣ 19 ༗Ի ແԻ Իݯ Իݯ৴߸ εϖΫτϧแབྷ
Իੜ ˎ ʹ ԻڹʢεϖΫτϧแབྷʣ ߹ܗ ԻݯHTS slide ΑΓ http://hts.sp.nitech.ac.jp/?Download ݩͷԻ
Իಛྔͷऔಘʢʣ • Իͷੜաఔʹج͍ͮͯಛྔΛநग़͠࠶߹ ⇒ ϘίʔμʔʢVocoderʣͷద༻ 20 ৭ ʢεϖΫτϧแབྷʣ ͷߴ͞ ʢجຊपʣ
͔͢Ε۩߹ ʢඇपظੑࢦඪʣ Իੳ Իੜ ʮ͜Μʹͪʯ ʮ͜Μʹͪʯ
Իಛྔͷऔಘʢʣ • ԻΛੳ ⇒ ಛྔͷ࣌ܥྻΛऔಘ 21 ൣғΛগͣͭͣ͠Βͯ͠ੳ ˞͜ΕεϖΫτϧแབྷ
Իಛྔͷऔಘʢʣ • ԻΛੳ ⇒ ಛྔͷ࣌ܥྻΛऔಘ 22 ˞͜Εجຊप ൣғΛগͣͭͣ͠Βͯ͠ੳ
Իੳ͔ΒԻੜ·ͰͷྲྀΕ • 23 Իੳ εϖΫτϧ แབྷ جຊप ඇपظੑ ࢦඪ Իੜ
ᶃԻΛੳͯ͠ ಛྔϕΫτϧͷ ܥྻΛखʹೖΕΔ ʜ ʜ ੳ୯Ґ ʹϑϨʔϜ ᶅੜ͞ΕͨܗΛ ॏͶ߹ΘͤΔ ᶄ֤ϑϨʔϜͷಛྔΛ Ϙίʔμʔʹ௨͢ ᶆݩͷԻܗ͕ ࠶ߏ͞ΕΔ ʜ ੳ ੜ
ԻมʢϘΠενΣϯδϟʔʣͷߟ͑ํ 24 Իੳ εϖΫτϧ แབྷ جຊप ඇपظੑࢦඪ Իੜ • ৴߸ॲཧͰม
• ػցֶशͰม Իಛྔͷม Իม ಛྔؒͷࣸ૾
ύϥϝʔλ੍ޚʹΑΔԻՃʢʣ • ղͨ͠ԻಛྔͷՃ 25 ৭ ʢεϖΫτϧแབྷʣ ͷߴ͞ ʢجຊपʣ ͔͢Ε۩߹ ʢඇपظੑࢦඪʣ
Իੳ Իಛྔʢύϥϝʔλʣ Իੜ Ͳ͔͜ͷ#͞Μ ʮ͜Μʹͪʯ มɾՃॲཧ "͞Μ ʮ͜Μʹͪʯ
ύϥϝʔλ੍ޚʹΑΔԻՃʢʣ • جຊपͷมߋ 26 ࣌ؒ ଳԻݯʢύϧεܥྻʣ ִؒΛڱΊΔ ִؒΛ͛Δ ݩͷ ߴ͍
͍ ߴ͘͘ ͳΔ͕ ෆࣗવ͕͞Δ
ύϥϝʔλ੍ޚʹΑΔԻՃʢʣ • ϑΥϧϚϯτγϑτ ⇒ ಓͷ৳ॖ ʹରԠ 27 प ύϫʔ ࠨʹγϑτ
ӈʹγϑτ ͬͨ͜ʁ ཧతͳղऍɿ • ಓ͕͍ 㱺 ࢠڙঁੑ • ಓ͕͍ 㱺 େਓஉੑ Ӊਓʁ ಓͷॖ ಓͷ৳ ݩͷ
ύϥϝʔλ੍ޚʹΑΔԻՃʢʣ • पˍϑΥϧϚϯτͷมߋ ⇒ ΑΓࣗવͳΛ߹ 28 ݩͷ ࠨγϑτ ˍ ڱִ͍ؒ
ӈγϑτ ˍ ִ͍ؒ ࢠڙঁੑͷ ଠ͍
σϞϯετϨʔγϣϯ ݚڀࣨͷֶੜ͞Μ͕࡞ͨ͠ΞϓϦ • εϥΠμʔͰύϥϝʔλ੍ޚ • ҎԼΛϕʔεͱͯ͠࡞ • $ ൛ 803-%
˞̍ • +6$&˞̎ 29 ˞̍ɿߴ࣭Ϙίʔμʔͷ̍ͭ ˞̎ɿΦʔσΟΦܥΞϓϦͷ ։ൃʹదͨ͠$ ϑϨʔϜϫʔΫ εϥΠμʔͰ ੍ޚ εϖΫτϧ දࣔ Ӷҙ։ൃதͰ͢ʂ
Իมٕज़ͷൃల ͍͔ͭ͘ͷൃలͷʮํੑʯΛڍ͛ΔʹཹΊ·͢ ཏతͰͳ͘ɺશͯΛհ͖͠Ε·ͤΜ͕ɺ͝༰͍ࣻͩ͘͞
౷ܭత࣭ม • ౷ܭϞσϧΛར༻ͯ͠ԻಛྔΛม ⇒ ಛྔͷରԠؔʢࣸ૾ʣΛେྔͷσʔλͰֶश͓ͯ͘͠ඞཁ 31 Իੳ Իੜ #͞Μʮ͜Μʹͪʯ ౷ܭϞσϧ
ʹΑΔม "͞Μʮ͜Μʹͪʯ εϖΫτϧแབྷ جຊप ඇपظੑࢦඪ • Ψεࠞ߹Ϟσϧ • χϡʔϥϧωοτ ൃ༰Λอͪͭͭ ऀͷ࣭Λม https://www.slideshare.net/KentaroTachibana1/ss-94259438 ࢀߟ ϘίʔμʔʹΑΔ Իੜ
ൃలͷํੑʢʣ • ͦͷ̍ɿϘίʔμʔͷԻ࣭Λྑͯ͘͠ੑೳ্ ⇒ χϡʔϥϧϘίʔμʔͷొʢ2017Ҏ߱ʣ ࢀߟʰԻܗੜϞσϧʮχϡʔϥϧϘίʔμʯͷൺֱʱ(2019.10) ͳͲ • ͦͷ̎ɿϘίʔμʔΛΘͣʹ࣭มʂ ࠩεϖΫτϧิਖ਼ʹج࣭ͮ͘ม
⇒ ϘίʔμʔʹΑΔԻ࣭ͷྼԽΛܰݮՄೳɺϦΞϧλΠϜࢤ 32 https://www.slideshare.net/Takuma_OKAMOTO/ss-180895505 • ཁɿߴ࣭ɺߴͳ߹ɺֶश͕༰қɺলϝϞϦԽɺɺɺ ⇒ ԻมγεςϜʹΈࠐΜͰੑೳ্Λࢦ͢
ൃలͷํੑʢʣ • ͦͷ̎ɿԻͷՃऩʹ͔͔ΔίετΛԼ͛Δʂ • ͦͷ̏ɿ࣭͚ͩͰͳ͘ײมʂ • ͦͷ̐ɿ͚ͩ͠Ͱͳ͘Վͷ࣭มʂ • ͦͷ̑ɿϦΞϧλΠϜ͔ͭߴ࣭ͳ࣭มʂ ʮχϡʔϥϧωοτશʯͷ࣌
• χϡʔϥϧωοτͷೖྗଆͱग़ྗଆʹԿΛ࣋ͬͯ͘Δ͔͕ΧΪ • σʔλ͑͞༻ҙͰ͖Εࣸ૾ֶ͕शͰ͖Δ͔ʢͰ͖Δͱݴͬͯͳ͍ʣ • ৴߸ॲཧʹجͮ͘ख๏͜Ε͔Β݈ࡏʢͷͣʣ 33
ԻใॲཧԻมʹ ศརͳ (Python) ύοέʔδ ιϑτΣΞͨͪ ༻ײͳͲࢲݟΛؚΈ·͢
TPYQZTPY • ίϚϯυϥΠϯ͔ΒϑΥʔϚοτมͳͲΛ͓खܰʹ • ϑΥʔϚοτมʢwav to mp3 ͳͲʣ • ݁߹ϛοΫεɺτϦϛϯά
Մೳ • όονॲཧָʢγΣϧεΫϦϓτͳͲʣ • Pythonϥούʔ pysox͋Δ • Πϯετʔϧ • brew install sox ͳͲ • pip install sox ← pysox ͷΠϯετʔϧ͜Ε 35
MJCSPTBʢ͜ΕຊʹΦεεϝʣ • Ի/ԻָͷੳʹศརͳϞδϡʔϧ͕ଗͬͨύοέʔδ • ެࣜϚχϡΞϧɾνϡʔτϦΞϧͷॆ࣮ॿ͔Δ • ݸਓతʹΑ͘͏ػೳ • ܗදࣔɺεϖΫτϩάϥϜදࣔ •
Իಛྔநग़ʢରϝϧεϖΫτϩάϥϜʣ • Πϯετʔϧ pip install librosa • ެࣜϖʔδ https://librosa.org/librosa/index.html 36
1Z8PSME • Իͷੳ࠶߹Λߦ͏Ϙίʔμʔͷύοέʔδ • ԻΛʮ৭ɾͷߴ͞ɾͷ͔͢Εʯͷ֤ʹղ͠࠶߹ • C++൛ͷPythonϥούʔ • Իͷಛྔநग़ʹ͑ͯศར ⇒
PySPTKʢޙड़ʣΑΓ࣭ͷΑ͍εϖΫτϧแབྷ • Πϯετʔϧ pip install pyworld • ެࣜϖʔδ https://github.com/JeremyCCHsu/Python-Wrapper-for-World-Vocoder 37
1Z"VEJP • ετϦʔϜԻ / ࠶ੜʹศརͳύοέʔδ • ϦΞϧλΠϜͷԻೖྗɾԻग़ྗʹ͑Δ • ϦΞϧλΠϜԻม with
PythonͳͲՄೳ • Πϯετʔϧ • pip install pyaudio ཁportaudio (e.g., brew install portaudio) 38
1Z"VEJPͱ1Z8PSMEͷΈ߹Θͤ • ؆қ൛ͷϘΠενΣϯδϟʔ • ؆қϘΠενΣϯδϟʔͷεΫϦϓτΛվྑɿPyQt5ͷεϥΠμʔʹΑΓ ϐονͱϑΥϧϚϯτΛϦΞϧλΠϜௐ͢ΔػೳΛՃʢฐϒϩάʣ • banibiku • Zoom৴͚ʹ̎࣍ݩΩϟϥʹͳΓ͖Δ͜ͱΛࢦͨ͠ϓϩδΣΫτ
• scripts/voice_converter.py ͕ྑ͍ײ͡ͷϘΠενΣϯδϟʔ → ฐϒϩάͷαϯϓϧεΫϦϓτͷόάϑΟοΫεؚ͕·ΕΔ 39 https://tam5917.hatenablog.com/entry/2019/04/30/213321 https://github.com/peisuke/babiniku
1Z"VEJPͱ1Z2UͷΈ߹Θͤ • ܗϞχλϦϯάʢʮ͍͋͏͓͑ʯʣ • ࢀߟ ϦΞϧλΠϜʹมԽ͢ΔԻͷܗΛදࣔ͠ଓ͚ΔPythonεΫϦϓτʢฐϒϩάʣ 40 https://tam5917.hatenablog.com/entry/2019/04/28/130641
1Z"VEJPͱ1Z2UͷΈ߹Θͤ • FFTεϖΫτϧϞχλϦϯάʢʮ͍͋͏͓͑ʯʣ • ࢀߟ ϦΞϧλΠϜʹมԽ͢ΔԻͷFFTεϖΫτϧΛදࣔ͢ΔPythonεΫϦϓτʢฐϒϩάʣ 41 https://tam5917.hatenablog.com/entry/2019/04/28/125857
1Z"VEJPͱ1Z2UͷΈ߹Θͤ • εϖΫτϧแབྷϞχλϦϯά ʢʮ͍͋͏͓͑ʯʣ ࢀߟ ϦΞϧλΠϜʹมԽ͢ΔԻͷεϖΫτϧแབྷΛදࣔ͢ΔPythonεΫϦϓτʢฐϒϩάʣ 42 https://tam5917.hatenablog.com/entry/2019/04/28/130641
1Z415, • ԻใॲཧπʔϧΩοτSPTKͷPythonϥούʔ • SPTKࣗମLinuxίϚϯυ܈ • Իڹಛྔநग़ʹ͏ͷ͕ศར • Իੳ߹Ͱ͖Δ͕ɺ࣭ࣗମWORLDͷ΄͏্͕ •
Πϯετʔϧ pip install pysptk • ެࣜϖʔδ https://pysptk.readthedocs.io/en/latest/ 43
OONOLXJJ <OBOBNJO LBXBJJ> • DNNԻ߹ʹཱͭϞδϡʔϧΛूΊͨύοέʔδ • ͲͪΒ͔ͱ͍͏ͱݚڀ༻్ • લॲཧԻڹಛྔநग़ͷΫϥε͕Ұ௨Γଗ͍ͬͯΔ •
จͷ࠶ݱ࣮Λ͢Δͱ͖ͳͲʹେ͍ʹཱͭ • Πϯετʔϧ pip install nnmnkwii • ެࣜϖʔδ https://r9y9.github.io/nnmnkwii/stable/index.html 44
1ZEVC • Pydub • ܗฤूʹศརͳϞδϡʔϧΛूΊͨύοέʔδ • αϙʔτ͢ΔϑΝΠϧܗࣜ๛ʢwav, mp3, mp4, wma,
aac, ...ʣ • ػೳ Γग़͠ɺׂɺϛοΫεɺϑΣʔυΠϯΞτɺແԻૠೖɺͳͲͳͲ • Ұ෦ͷػೳ pysoxͷ΄͏͕ߴͱ͍͏ӟ?ʢະ֬ೝʣ • Πϯετʔϧ pip install pydub • ެࣜϖʔδ http://pydub.com/ 45
TQSPDLFU • ౷ܭత࣭มͷͨΊͷπʔϧΩοτ (not ύοέʔδ) • ͲͪΒ͔ͱ͍͏ͱݚڀ༻ ʢMITϥΠηϯεʣ • ݚڀͷʮϕʔεϥΠϯʯߏஙʹ࠷ద
• ެࣜϖʔδ https://github.com/k2kobayashi/sprocket • ղઆจ ʰ౷ܭత࣭มιϑτΣΞೖʱ https://www.jstage.jst.go.jp/article/isciesci/62/2/62_69/_article/-char/ja/ • νϡʔτϦΞϧ (εϥΠυ & notebook) https://github.com/kan-bayashi/INTERSPEECH19_TUTORIAL 46
"VEBDJUZ ೖΕ͓ͯ͘ͱ҆৺ • ϑϦʔͷܗฤूιϑτɺϚϧνϓϥοτϑΥʔϜ • ๛ͳαϯυΤϑΣΫτՃػೳ • ެࣜϖʔδ https://www.audacityteam.org/ 47
ͦͷଞ • Voidol • ϦΞϧλΠϜ࣭มʢ༗ྉʣ • ࣗͷΛλʔήοτͷΩϟϥʹม • Realtime Yukarin
• ϦΞϧλΠϜ࣭ม • Gachikoe! Core • ϦΞϧλΠϜ࣭ม 48 https://crimsontech.jp/apps/voidol/ https://blog.hiroshiba.jp/realtime-yukarin-introduction/ https://booth.pm/ja/items/1236505
͓·͚ • GoogleԻ߹ɺԻೝࣝɺ༁APIͷPythonϥούʔ ⇒ gTTS, SpeechRecognition, googletrans • ͦΕͧΕΛ͓ࢼ͠ͰͬͯΈΔʹे •
ࢀߟ GoogleԻೝࣝͷ݁ՌΛGoogle༁͠ɺGoogle Text-to-SpeechͰ Իʹ͢PythonεΫϦϓτʢฐϒϩάʣ https://tam5917.hatenablog.com/entry/2019/04/28/191946 49
͓ΘΓʹ • Իใॲཧͷʢʣೖ Իʹؚ·ΕΔใʢݴޠɾύϥݴޠɾඇݴޠʣΛॲཧ ⇒ Ի߹Իೝࣝɺऀ/ײೝࣝɺԻม ͳͲ • Իมٕज़ͷհ ϘΠενΣϯδϟʔͷΈɿԻͷੜաఔͷϞσϧԽ͕جૅ
• ศརͳʢPythonʣύοέʔδΛհ 50 ԻใॲཧɾԻมٕज़ʹ ڵຯΛ࣋ͬͯΒ͑ͨΒ͍Ͱ͢