Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
【動画あり】Transformer論文解説
Search
Sponsored
·
SiteGround - Reliable hosting with speed, security, and support you can count on.
→
数理の弾丸
July 16, 2024
Technology
370
0
Share
【動画あり】Transformer論文解説
下記YouTube動画で使用したスライド資料です。
https://youtu.be/6tcjwdanedU
数理の弾丸
July 16, 2024
More Decks by 数理の弾丸
See All by 数理の弾丸
Claude Code × autoresearch 実践
mathbullet
0
140
RAG:チャットボットの能力を底上げする技術
mathbullet
0
270
ゼロから始める大規模言語モデル入門
mathbullet
0
250
[動画あり] 線形回帰を題材に汎用的な理解を身につける:座学編
mathbullet
0
120
[動画あり] AI入門特急コース
mathbullet
0
230
Other Decks in Technology
See All in Technology
地元にいないローカルオーガナイザーの立ち回り
uvb_76
1
440
AI-DLCを活用した高品質・安全なAI駆動開発実践 / AI Driven Development with AI-DLC
yoshidashingo
0
100
Platform Engineering as a Product: Criteria for Improvement and Multi-Tenant Design
kumorn5s
0
480
Gradle×GitHub_ActionsでCI時間を約50%短縮 ジョブ分割の設計と落とし穴 / Cutting CI Time by ~50% with Gradle and GitHub Actions: Job-Splitting Design and Pitfalls
takatty
0
610
APIテストとは?
nagix
0
170
「嘘をつくテスト」の失敗例から学ぶ 良いテストコード #frontend_phpcon_do
asumikam
0
150
Claude code Orchestra
ozakiomumkj
3
920
脅威をエンジニアリングの糧にして:恐怖を乗り越えた先にあったもの / Turn threats into fuel for engineering: what lay beyond overcoming fear
nrslib
1
380
サプライチェーンセキュリティの空白地帯 - 信頼できる”依存性”の未来を考える
rung
PRO
2
650
先取りMaven4 ~16年ぶりのメジャーアップデート、その進化とは?~
ogiwarat
0
140
Unlocking the Apps
pimterry
0
180
AI フレンドリーなエラー監視を TypeScript で実現する
shinyaigeek
2
240
Featured
See All Featured
Scaling GitHub
holman
464
140k
Navigating the Design Leadership Dip - Product Design Week Design Leaders+ Conference 2024
apolaine
1
330
Impact Scores and Hybrid Strategies: The future of link building
tamaranovitovic
0
300
Tell your own story through comics
letsgokoyo
1
940
Designing Dashboards & Data Visualisations in Web Apps
destraynor
231
55k
Optimizing for Happiness
mojombo
378
71k
SERP Conf. Vienna - Web Accessibility: Optimizing for Inclusivity and SEO
sarafernandez
2
1.5k
The browser strikes back
jonoalderson
0
1.1k
BBQ
matthewcrist
89
10k
Docker and Python
trallard
47
3.9k
The AI Search Optimization Roadmap by Aleyda Solis
aleyda
1
5.9k
Making Projects Easy
brettharned
120
6.7k
Transcript
ࠓճͷ༰ ࠷ॳͷϖʔδ ༰ղઆ จಡΉͱ͖ʹԿΛߟ͍͑ͯΔ͔ʁ ͦͷޙͷల։ ͜ͷจ୯ମͷཧղʹͱͲ·Βͣ จͷಡΈํɾͰͷҐஔ͚ΛΔ 5SBOTGPSNFSఏҊจΛಡΉ 7BTXBOJ
"TIJTI FUBM"UUFOUJPOJTBMMZPVOFFE"EWBODFTJOOFVSBMJOGPSNBUJPOQSPDFTTJOHTZTUFNT
ͳͥ͜ͷจ͕ॏཁͳͷ͔ʁ
5SBOTGPSNFSͷԠ༻ൣғ ※: https://blog.google/products/search/search-language-understanding-bert/ FUD 5SBOTGPSNFS ςΩετ༁Λओ؟ͱͯ͠ఏҊ #&35 (15 ςΩετྨFUD
ςΩετੜ ෦ΞʔΩςΫνϟͷ࠾༻ ը૾ͷద༻ 7J5 %JGGVTJPO 5SBOTGPSNFS ը૾ྨFUD ը૾ੜ $IBU(15 -MBNB 4UBCMF%JGGVTJPO 4PSB (PPHMFݕࡧ˞ $-*1 ۃΊͯൣғʹج൫ٕज़ͱͯ͠׆༂
ਓೳͷจΛಡΉΓޱ
ਓೳͷจΛಡΉΓޱ ର߅അԿ͔ ओఏҊԿ͔ ධՁͷ8IBU3FTVMU
ਓೳͷจΛಡΉΓޱ ର߅അԿ͔ ओఏҊԿ͔ ධՁͷ8IBU3FTVMU *OUSPEVDUJPO #BDLHSPVOE .PEFM"SDIJUFDUVSF
8IZ4FMG"UUFOUJPO 5SBJOJOH 3FTVMUT
จͷߏ ΞϒετϥΫτ ΠϯτϩμΫγϣϯ ؔ࿈ݚڀ ఏҊख๏ ࣮ݧઃఆɾ݁Ռɾٞ ݁
จͷߏ ΞϒετϥΫτ ΠϯτϩμΫγϣϯ ؔ࿈ݚڀ ఏҊख๏ ࣮ݧઃఆɾ݁Ռɾٞ ݁ ⁞֓ཁΛ௫Ή
จͷߏ ΞϒετϥΫτ ΠϯτϩμΫγϣϯ ؔ࿈ݚڀ ఏҊख๏ ࣮ݧઃఆɾ݁Ռɾٞ ݁ ⁞֓ཁΛ௫Ή ओுΛ௫Ή
จͷߏ ΞϒετϥΫτ ΠϯτϩμΫγϣϯ ؔ࿈ݚڀ ఏҊख๏ ࣮ݧઃఆɾ݁Ռɾٞ ݁ ⁞֓ཁΛ௫Ή ओுΛ௫Ή
ॏΈ͚ͯ͠ಡΉ
ਓೳͷจΛಡΉΓޱ ର߅അԿ͔ ओఏҊԿ͔ ධՁͷ8IBU3FTVMU *OUSPEVDUJPO #BDLHSPVOE .PEFM"SDIJUFDUVSF
8IZ4FMG"UUFOUJPO 5SBJOJOH 3FTVMUT
ܥྻϞσϦϯά
ܥྻϞσϦϯά ॱংͷ͋Δཁૉͷ࿈ͳΓͱΈͳͤΔͷΛܥྻʢTFRVFODFʣͱݺͼɺ ͜ΕΛରͱ͢ΔϞσϦϯάΛܥྻϞσϦϯάͱݺͿ
ܥྻϞσϦϯά ॱংͷ͋Δཁૉͷ࿈ͳΓͱΈͳͤΔͷΛܥྻʢTFRVFODFʣͱݺͼɺ ͜ΕΛରͱ͢ΔϞσϦϯάΛܥྻϞσϦϯάͱݺͿ ྫ ༁ ҙػߏ͑͋͞Εे "UUFOUJPOJTBMMZPVOFFE ྫ ςΩετੜ
Ͳ͏ͧΑΖ͓͘͠ئ͍͠·͢ɻ Կ͔࣭͝ϦΫΤετ͕͋Εڭ͍͑ͯͩ͘͞ɻ ΑΖ͘͠པΉ ྫ ߏจղੳ 4 /1ΑΖ͘͠ 71པΉ ΑΖ͘͠པΉ
ܥྻϞσϦϯά ॱংͷ͋Δཁૉͷ࿈ͳΓͱΈͳͤΔͷΛܥྻʢTFRVFODFʣͱݺͼɺ ͜ΕΛରͱ͢ΔϞσϦϯάΛܥྻϞσϦϯάͱݺͿ ྫ ༁ ҙػߏ͑͋͞Εे "UUFOUJPOJTBMMZPVOFFE ྫ ςΩετੜ
Ͳ͏ͧΑΖ͓͘͠ئ͍͠·͢ɻ Կ͔࣭͝ϦΫΤετ͕͋Εڭ͍͑ͯͩ͘͞ɻ ΑΖ͘͠པΉ ྫ ߏจղੳ 4 /1ΑΖ͘͠ 71པΉ ΑΖ͘͠པΉ TPVSDF
ܥྻϞσϦϯά ॱংͷ͋Δཁૉͷ࿈ͳΓͱΈͳͤΔͷΛܥྻʢTFRVFODFʣͱݺͼɺ ͜ΕΛରͱ͢ΔϞσϦϯάΛܥྻϞσϦϯάͱݺͿ ྫ ༁ ҙػߏ͑͋͞Εे "UUFOUJPOJTBMMZPVOFFE ྫ ςΩετੜ
Ͳ͏ͧΑΖ͓͘͠ئ͍͠·͢ɻ Կ͔࣭͝ϦΫΤετ͕͋Εڭ͍͑ͯͩ͘͞ɻ ΑΖ͘͠པΉ ྫ ߏจղੳ 4 /1ΑΖ͘͠ 71པΉ ΑΖ͘͠པΉ UBSHFU
ॏཁ՝ɿڑґଘͷཧղ ൴͕ॻ͍ͨͦͷຊΛɺࢲҰಡΜͩ͜ͱ͕͋Γ·ͤΜɻ తޠ ओޠ ҐஔతʹΕͨܥྻཁૉؒͷґଘؔ
ॏཁ՝ɿڑґଘͷཧղ ൴͕ॻ͍ͨͦͷຊΛɺࢲҰಡΜͩ͜ͱ͕͋Γ·ͤΜɻ తޠ ओޠ ҐஔతʹΕͨܥྻཁૉؒͷґଘؔ ڑґଘΛѲͰ͖ͳ͍ͱେͷλεΫղ͚ͳ͍
ର߅അԿ͔ *OUSPEVDUJPO#BDLHSPVOE
ର߅അԿ͔ ࠶ؼܕχϡʔϥϧωοτϫʔΫ ΈࠐΈχϡʔϥϧωοτϫʔΫ -45.<)PDISFJUFS > (36<$IVOH > FUD #ZUF/FU<,BMDICSFOOFS
> $POW44<(FISJOH > FUD ܥྻͷཁૉΛॱʹೖྗ͍ͯ͘͠ ฒྻܭࢉ͕Ͱ͖ͳ͍ ཁૉؒڑʹԠͨ͡ܭࢉྔ૿Ճ͕ݦஶ ڑґଘͷֶश͕ࠔ
ର߅അԿ͔ ࠶ؼܕχϡʔϥϧωοτϫʔΫ ΈࠐΈχϡʔϥϧωοτϫʔΫ -45.<)PDISFJUFS > (36<$IVOH > FUD #ZUF/FU<,BMDICSFOOFS
> $POW44<(FISJOH > FUD ܥྻͷཁૉΛॱʹೖྗ͍ͯ͘͠ ฒྻܭࢉ͕Ͱ͖ͳ͍ ཁૉؒڑʹԠͨ͡ܭࢉྔ૿Ճ͕ݦஶ ڑґଘͷֶश͕ࠔ ฒྻԽ͕Մೳ͔ͭڑґଘΛֶशͰ͖ΔϞσϧͱͯ͠ 5SBOTGPSNFSΛఏҊʢ4FD ʣ
طଘݚڀ͔ΒҾ͖ܧ͙ͷ Τϯίʔμɾσίʔμػߏ FODPEFSEFDPEFSNFDIBOJTN ࣗݾҙػߏ TFMGBUUFOUJPONFDIBOJTN Τϯίʔμ σίʔμ ೖྗ ग़ྗ
ಛநग़ɾܥྻੜͷೋஈߏ͑ ࢲ ٢ా ࢲ ٢ా ࣗܥྻؒͰͷॏΈ͚
طଘݚڀ͔ΒҾ͖ܧ͙ͷ Τϯίʔμɾσίʔμػߏ FODPEFSEFDPEFSNFDIBOJTN ࣗݾҙػߏ TFMGBUUFOUJPONFDIBOJTN Τϯίʔμ σίʔμ ೖྗ ग़ྗ
ಛநग़ɾܥྻੜͷೋஈߏ͑ ࢲ ٢ా ࢲ ٢ా ࣗܥྻؒͰͷॏΈ͚ Τϯίʔμɾσίʔμͷ༗༻ੑΛੜ͔ͭͭ͠ ࣗݾҙػߏͰ݁͢ΔॳΊͯͷϞσϧʢ4FDʣ
·ͱΊ ର߅അԿ͔ ओఏҊԿ͔ ධՁͷ8IBU3FTVMU ࠶ؼܕωοτϫʔΫ ΈࠐΈωοτϫʔΫ ࣗݾҙͰ݁ͨ͠ ΤϯίʔμɾσίʔμΛఏҊ w
ฒྻԽ͕༰қ w ڑґଘΛଊ͑Δ .PEFM"SDIJUFDUVSF 8IZ4FMG"UUFOUJPO 5SBJOJOH 3FTVMUT
͜͜·ͰಡΉͱओ؟͕Θ͔Δ ฒྻԽՄೳͰɺ͔ͭڑͷґଘؔΛ ଊ͑ΒΕΔϝΧχζϜͱʁ
ओఏҊԿ͔ .PEFM"SDIJUFDUVSF8IZ4FMG"UUFOUJPO
ఏҊϞσϧͷΞʔΩςΫνϟ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ 'FFE'PSXBSE /FUXPSL Ճࢉਖ਼نԽ ϚεΫ͖
Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ճࢉਖ਼نԽ ೖྗςΩετ ग़ྗςΩετ 'FFE'PSXBSE /FUXPSL ઢܗม ιϑτϚοΫεؔ ֬ Ґஔූ߸Խ Ґஔූ߸Խ 🌟 🌟 🌟 /ʷ ʷ/ ˞ਤ7BTXBOJ ͷ'JHVSFΛϕʔεͱͯ͠࡞
Ґஔූ߸Խ ఏҊϞσϧͷΞʔΩςΫνϟ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ 'FFE'PSXBSE /FUXPSL Ճࢉਖ਼نԽ
ϚεΫ͖ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ճࢉਖ਼نԽ ೖྗςΩετ ग़ྗςΩετ 'FFE'PSXBSE /FUXPSL ઢܗม ιϑτϚοΫεؔ ֬ 🌟 🌟 🌟 /ʷ ʷ/ Τϯίʔμ Ґஔූ߸Խ
Ґஔූ߸Խ ఏҊϞσϧͷΞʔΩςΫνϟ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ 'FFE'PSXBSE /FUXPSL Ճࢉਖ਼نԽ
ϚεΫ͖ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ճࢉਖ਼نԽ ೖྗςΩετ ग़ྗςΩετ 'FFE'PSXBSE /FUXPSL ઢܗม ιϑτϚοΫεؔ ֬ 🌟 🌟 🌟 /ʷ ʷ/ σίʔμ Ґஔූ߸Խ
Ґஔූ߸Խ ఏҊϞσϧͷΞʔΩςΫνϟ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ 'FFE'PSXBSE /FUXPSL Ճࢉਖ਼نԽ
ϚεΫ͖ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ճࢉਖ਼نԽ ೖྗςΩετ ग़ྗςΩετ 'FFE'PSXBSE /FUXPSL ઢܗม ιϑτϚοΫεؔ ֬ 🌟 🌟 🌟 /ʷ ʷ/ σίʔμ Ґஔූ߸Խ ࣗݾճؼ BVUPSFHSFTTJPO ࣌ࠁ ͷग़ྗ͕ ͷೖྗʹͳΔػߏ t t + 1
ఏҊϞσϧͷΞʔΩςΫνϟ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ 'FFE'PSXBSE /FUXPSL Ճࢉਖ਼نԽ ϚεΫ͖
Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ճࢉਖ਼نԽ ೖྗςΩετ ग़ྗςΩετ 'FFE'PSXBSE /FUXPSL ઢܗม ιϑτϚοΫεؔ ֬ 🌟 🌟 🌟 /ʷ ʷ/ ˞ਤ7BTXBOJ ͷ'JHVSFΛϕʔεͱͯ͠࡞ Ґஔූ߸Խ Ґஔූ߸Խ
ఏҊϞσϧͷΞʔΩςΫνϟ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ 'FFE'PSXBSE /FUXPSL Ճࢉਖ਼نԽ ϚεΫ͖
Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ճࢉਖ਼نԽ ೖྗςΩετ ग़ྗςΩετ 'FFE'PSXBSE /FUXPSL ઢܗม ιϑτϚοΫεؔ ֬ 🌟 🌟 🌟 /ʷ ʷ/ ˞ਤ7BTXBOJ ͷ'JHVSFΛϕʔεͱͯ͠࡞ Ґஔූ߸Խ Ґஔූ߸Խ
ఏҊϞσϧͷΞʔΩςΫνϟ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ 'FFE'PSXBSE /FUXPSL Ճࢉਖ਼نԽ ϚεΫ͖
Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ճࢉਖ਼نԽ ೖྗςΩετ ग़ྗςΩετ 'FFE'PSXBSE /FUXPSL ઢܗม ιϑτϚοΫεؔ ֬ 🌟 🌟 🌟 /ʷ ʷ/ ˞ਤ7BTXBOJ ͷ'JHVSFΛϕʔεͱͯ͠࡞ ͔͜͜ΒઌͷॲཧೖྗςΩετͷ ޠॱΛೝࣝͰ͖ͳ͍ Ґஔූ߸Խ Ґஔූ߸Խ
ຒΊࠐΈɾҐஔූ߸Խ ς Ω ε τ τ Ϋ ϯ Խ
ࢲ ٢ా ʜ ʜ ʜ ຒΊࠐΈ e1 e2 e3 ࣍ݩͷϕΫτϧ dmodel
ຒΊࠐΈɾҐஔූ߸Խ ς Ω ε τ τ Ϋ ϯ Խ ࢲ
٢ా ʜ ʜ ʜ ຒΊࠐΈ e1 e2 e3 ʜ ʜ ʜ Ґஔූ߸Խ p1 p2 p3 ppos [2i] = sin ( pos 10000 2i dmodel ) ppos [2i + 1] = cos ( pos 10000 2i dmodel )
ຒΊࠐΈɾҐஔූ߸Խ ς Ω ε τ τ Ϋ ϯ Խ
ࢲ ٢ా ʜ ʜ ʜ ຒΊࠐΈ e1 e2 e3 ʜ ʜ ʜ Ґஔූ߸Խ p1 p2 p3 ppos [2i] = sin ( pos 10000 2i dmodel ) ppos [2i + 1] = cos ( pos 10000 2i dmodel ) x1 x2 x3 = de1 + p1 = de2 + p2 = de3 + p3
ఏҊϞσϧͷΞʔΩςΫνϟ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ 'FFE'PSXBSE /FUXPSL Ճࢉਖ਼نԽ ϚεΫ͖
Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ճࢉਖ਼نԽ ೖྗςΩετ ग़ྗςΩετ 'FFE'PSXBSE /FUXPSL ઢܗม ιϑτϚοΫεؔ ֬ 🌟 🌟 🌟 /ʷ ʷ/ ˞ਤ7BTXBOJ ͷ'JHVSFΛϕʔεͱͯ͠࡞ ࣗݾҙ TFMGBUUFOUJPO w ϕΫτϧྻΛจ຺Λߟྀ͠ͳ͕Βม w 5SBOTGPSNFSͷ࠷ॏཁͳ෦ Ґஔූ߸Խ Ґஔූ߸Խ
ࣗݾҙ x1 x2 x3 Q = [ q1 ]
[ q2 ] [ q3 ] K = [ k1 ] [ k2 ] [ k3 ] V = [ v1 ] [ v2 ] [ v3 ] qi = Wq xi , Wq ∈ ℝdk ×dmodel ki = Wk xi , Wk ∈ ℝdk ×dmodel vi = Wv xi , Wv ∈ ℝdv ×dmodel
ࣗݾҙ x1 x2 x3 Q = [ q1 ]
[ q2 ] [ q3 ] K = [ k1 ] [ k2 ] [ k3 ] V = [ v1 ] [ v2 ] [ v3 ] h1 h2 h3 = ◯v1 + ◯v2 + ◯v3 = ◯v1 + ◯v2 + ◯v3 = ◯v1 + ◯v2 + ◯v3 ͜Ε͔Βܭࢉ͍ͨ͠ͷ पลจ຺ͷࠞͥ߹Θͤ۩߹Λ˓ͷ͕ܾΊΔ ͜ΕΛ Λ༻͍ͯٻΊΔ Q, K
ࣗݾҙ h1 h2 h3 = ◯v1 + ◯v2 +
◯v3 = ◯v1 + ◯v2 + ◯v3 = ◯v1 + ◯v2 + ◯v3 ͜Ε͔Βܭࢉ͍ͨ͠ͷ पลจ຺ͷࠞͥ߹Θͤ۩߹Λ˓ͷ͕ܾΊΔ ͜ΕΛ Λ༻͍ͯٻΊΔ Q, K QK⊤ = [ q1 ] [ q2 ] [ q3 ] [ k1 ] [ k2 ] [ k3 ] = q1 ⋅ k1 q1 ⋅ k2 q1 ⋅ k3 q2 ⋅ k1 q2 ⋅ k2 q2 ⋅ k3 q3 ⋅ k1 q3 ⋅ k2 q3 ⋅ k3
ࣗݾҙ h1 h2 h3 = ◯v1 + ◯v2 +
◯v3 = ◯v1 + ◯v2 + ◯v3 = ◯v1 + ◯v2 + ◯v3 ͜Ε͔Βܭࢉ͍ͨ͠ͷ पลจ຺ͷࠞͥ߹Θͤ۩߹Λ˓ͷ͕ܾΊΔ ͜ΕΛ Λ༻͍ͯٻΊΔ Q, K QK⊤ = [ q1 ] [ q2 ] [ q3 ] [ k1 ] [ k2 ] [ k3 ] = q1 ⋅ k1 q1 ⋅ k2 q1 ⋅ k3 q2 ⋅ k1 q2 ⋅ k2 q2 ⋅ k3 q3 ⋅ k1 q3 ⋅ k2 q3 ⋅ k3 ʮ٢ాʯ͔Βݟͨʮࢲʯͷॏཁ
ࣗݾҙ h1 h2 h3 = ◯v1 + ◯v2 +
◯v3 = ◯v1 + ◯v2 + ◯v3 = ◯v1 + ◯v2 + ◯v3 ͜Ε͔Βܭࢉ͍ͨ͠ͷ पลจ຺ͷࠞͥ߹Θͤ۩߹Λ˓ͷ͕ܾΊΔ ͜ΕΛ Λ༻͍ͯٻΊΔ Q, K QK⊤ = [ q1 ] [ q2 ] [ q3 ] [ k1 ] [ k2 ] [ k3 ] = q1 ⋅ k1 q1 ⋅ k2 q1 ⋅ k3 q2 ⋅ k1 q2 ⋅ k2 q2 ⋅ k3 q3 ⋅ k1 q3 ⋅ k2 q3 ⋅ k3 ʮ٢ాʯ͔Βݟͨʮʯͷॏཁ
ࣗݾҙ h1 h2 h3 = ◯v1 + ◯v2 +
◯v3 = ◯v1 + ◯v2 + ◯v3 = ◯v1 + ◯v2 + ◯v3 ͜Ε͔Βܭࢉ͍ͨ͠ͷ पลจ຺ͷࠞͥ߹Θͤ۩߹Λ˓ͷ͕ܾΊΔ ͜ΕΛ Λ༻͍ͯٻΊΔ Q, K QK⊤ = [ q1 ] [ q2 ] [ q3 ] [ k1 ] [ k2 ] [ k3 ] = q1 ⋅ k1 q1 ⋅ k2 q1 ⋅ k3 q2 ⋅ k1 q2 ⋅ k2 q2 ⋅ k3 q3 ⋅ k1 q3 ⋅ k2 q3 ⋅ k3 ʮ٢ాʯ͔Βݟͨʮ٢ాʯͷॏཁ
ࣗݾҙ h1 h2 h3 = ◯v1 + ◯v2 +
◯v3 = ◯v1 + ◯v2 + ◯v3 = ◯v1 + ◯v2 + ◯v3 ͜Ε͔Βܭࢉ͍ͨ͠ͷ पลจ຺ͷࠞͥ߹Θͤ۩߹Λ˓ͷ͕ܾΊΔ ͜ΕΛ Λ༻͍ͯٻΊΔ Q, K softmax ( QK⊤ dk ) = a11 a12 a13 a21 a22 a23 a31 a32 a33 εέʔϦϯά ิ ͜ͷߦྻΛҙߦྻ BUUFOUJPONBUSJY ͱݺͿ ֤ ҙॏΈ BUUFOUJPOXFJHIU ͱݺͿ aij
ࣗݾҙ h1 h2 h3 = a21 v1 + a22
v2 + a23 v3 ࣗݾҙͷ࠷ऴग़ྗ पลจ຺ͷࠞͥ߹Θͤ۩߹Λ˓ͷ͕ܾΊΔ ͜ΕΛ Λ༻͍ͯٻΊΔ Q, K softmax ( QK⊤ dk ) = a11 a12 a13 a21 a22 a23 a31 a32 a33 εέʔϦϯά = a11 v1 + a12 v2 + a13 v3 = a31 v1 + a32 v2 + a33 v3
ࣗݾҙ h1 h2 h3 = a21 v1 + a22
v2 + a23 v3 ࣗݾҙͷ࠷ऴग़ྗ पลจ຺ͷࠞͥ߹Θͤ۩߹Λ˓ͷ͕ܾΊΔ ͜ΕΛ Λ༻͍ͯٻΊΔ Q, K softmax ( QK⊤ dk ) = a11 a12 a13 a21 a22 a23 a31 a32 a33 εέʔϦϯά = a11 v1 + a12 v2 + a13 v3 = a31 v1 + a32 v2 + a33 v3 ࣗݾҙ ʹपลจ຺Λߟྀ͢Δػߏ
ఏҊϞσϧͷΞʔΩςΫνϟ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ 'FFE'PSXBSE /FUXPSL Ճࢉਖ਼نԽ ϚεΫ͖
Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ճࢉਖ਼نԽ ೖྗςΩετ ग़ྗςΩετ 'FFE'PSXBSE /FUXPSL ઢܗม ιϑτϚοΫεؔ ֬ 🌟 🌟 🌟 /ʷ ʷ/ ˞ਤ7BTXBOJ ͷ'JHVSFΛϕʔεͱͯ͠࡞ Ґஔූ߸Խ Ґஔූ߸Խ
ϚεΫ͖ࣗݾҙ softmax ( QK⊤ dk ) = a11 a12
a13 a21 a22 a23 a31 a32 a33
ϚεΫ͖ࣗݾҙ softmax ( QK⊤ dk ) = a11 a12
a13 a21 a22 a23 a31 a32 a33 [ 1 0 0 1 1 0 1 1 1 ] ϚεΫߦྻ
ϚεΫ͖ࣗݾҙ softmax ( QK⊤ dk ) = a11 a12
a13 a21 a22 a23 a31 a32 a33 [ 1 0 0 1 1 0 1 1 1 ] ϚεΫߦྻ a11 a12 a13 a21 a22 a23 a31 a32 a33 ⊙ [ 1 0 0 1 1 0 1 1 1 ] = a11 0 0 a21 a22 0 a31 a32 a33
ϚεΫ͖ࣗݾҙ h1 h2 h3 = a21 v1 + a22
v2 + 0v3 ࣗݾҙͷ࠷ऴग़ྗ softmax ( QK⊤ dk ) = a11 a12 a13 a21 a22 a23 a31 a32 a33 = a11 v1 + 0v2 + 0v3 = a31 v1 + a32 v2 + a33 v3 [ 1 0 0 1 1 0 1 1 1 ] ϚεΫߦྻ a11 a12 a13 a21 a22 a23 a31 a32 a33 ⊙ [ 1 0 0 1 1 0 1 1 1 ] = a11 0 0 a21 a22 0 a31 a32 a33
ࣗݾҙʹ͍ͭͯཧ
ࣗݾҙ h1 h2 h3 = ◯v1 + ◯v2 +
◯v3 = ◯v1 + ◯v2 + ◯v3 = ◯v1 + ◯v2 + ◯v3 ܭࢉ͍ͨ͠ͷ QK⊤ = [ q1 ] [ q2 ] [ q3 ] [ k1 ] [ k2 ] [ k3 ] = q1 ⋅ k1 q1 ⋅ k2 q1 ⋅ k3 q2 ⋅ k1 q2 ⋅ k2 q2 ⋅ k3 q3 ⋅ k1 q3 ⋅ k2 q3 ⋅ k3 softmax ( QK⊤ dk ) = a11 a12 a13 a21 a22 a23 a31 a32 a33
Ϛϧνϔουࣗݾҙ 'JHΑΓൈਮ Λ ׂͯ͠ฒྻॲཧ ग़ྗΛܨ͛ͯͻͱͭʹ͢Δ Q, K, V h
จͰ Ͱ࣮ݧ h = 1,4,8,16,32
ఏҊϞσϧͷΞʔΩςΫνϟ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ 'FFE'PSXBSE /FUXPSL Ճࢉਖ਼نԽ ϚεΫ͖
Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ճࢉਖ਼نԽ ೖྗςΩετ ग़ྗςΩετ 'FFE'PSXBSE /FUXPSL ઢܗม ιϑτϚοΫεؔ ֬ 🌟 🌟 🌟 /ʷ ʷ/ ˞ਤ7BTXBOJ ͷ'JHVSFΛϕʔεͱͯ͠࡞ Ґஔූ߸Խ Ґஔූ߸Խ
1PTJUJPOXJTF'FFE'PSXBSE/FUXPSLT ࣗ ݾ ҙ ࢲ ٢ా ʜ
ʜ ʜ จ຺ԽຒΊࠐΈ h1 h2 h3 ReLU(h1 W1 + b1 )W2 + b2 ReLU(h2 W1 + b1 )W2 + b2 ReLU(h3 W1 + b1 )W2 + b2 ϕΫτϧͦΕͧΕʹରͯ͠'FFEGPSXBSE/FUXPSLΛద༻
1PTJUJPOXJTF'FFE'PSXBSE/FUXPSLT ࣗ ݾ ҙ ࢲ ٢ా ʜ
ʜ ʜ จ຺ԽຒΊࠐΈ h1 h2 h3 ReLU(h1 W1 + b1 )W2 + b2 ReLU(h2 W1 + b1 )W2 + b2 ReLU(h3 W1 + b1 )W2 + b2 ϕΫτϧͦΕͧΕʹରͯ͠'FFEGPSXBSE/FUXPSLΛద༻ 5SBOTGPSNFSʹ͓͚Δ''/ͷׂʹ͍ͭͯͦͷޙ͞·͟·ͳ͕ٞ͋Δ w (FWB .PS FUBM5SBOTGPSNFSGFFEGPSXBSEMBZFSTBSFLFZWBMVFNFNPSJFTBS9JWQSFQSJOUBS9JW w ;IBOH ;IFOHZBO FUBM.PF fi DBUJPO5SBOTGPSNFSGFFEGPSXBSEMBZFSTBSFNJYUVSFTPGFYQFSUTBS9JWQSFQSJOUBS9JW w (FWB .PS FUBM5SBOTGPSNFSGFFEGPSXBSEMBZFSTCVJMEQSFEJDUJPOTCZQSPNPUJOHDPODFQUTJOUIFWPDBCVMBSZTQBDFBS9JWQSFQSJOUBS9JW w FUD
ఏҊϞσϧͷΞʔΩςΫνϟ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ 'FFE'PSXBSE /FUXPSL Ճࢉਖ਼نԽ ϚεΫ͖
Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ϛϧνϔου ࣗݾҙ Ճࢉਖ਼نԽ Ճࢉਖ਼نԽ ೖྗςΩετ ग़ྗςΩετ 'FFE'PSXBSE /FUXPSL ઢܗม ιϑτϚοΫεؔ ֬ 🌟 🌟 🌟 /ʷ ʷ/ ˞ਤ7BTXBOJ ͷ'JHVSFΛϕʔεͱͯ͠࡞ Ґஔූ߸Խ Ґஔූ߸Խ
·ͱΊ ର߅അԿ͔ ओఏҊԿ͔ ධՁͷ8IBU3FTVMU ࠶ؼܕωοτϫʔΫ ΈࠐΈωοτϫʔΫ ࣗݾҙͰ݁ͨ͠ ΤϯίʔμɾσίʔμΛఏҊ w
ฒྻԽ͕༰қ w ڑґଘΛଊ͑Δ 5SBJOJOH 3FTVMUT 5SBOTGPSNFSͷػߏ Ґஔූ߸Խ
ධՁͷ8IBU3FTVMU 5SBJOJOH3FTVMUT
8IBU λεΫ ػց༁ σʔληοτ 8.5 w χϡʔε༁ͷֶशɾධՁσʔληοτ w FOEFNJMMJPOTFOUFODFQBJST
w FOGSNJMMJPOTFOUFODFQBJST ධՁࢦඪ w #-&6ʢ༁ੑೳʣ w '-01Tʢܭࢉྔʣ
3FTVMU ଞϞσϧʹඖఢ͢ΔੑೳΛΑΓগͳֶ͍शίετͰ࣮ݱ
·ͱΊ ର߅അԿ͔ ओఏҊԿ͔ ධՁͷ8IBU3FTVMU ࠶ؼܕωοτϫʔΫ ΈࠐΈωοτϫʔΫ ࣗݾҙͰ݁ͨ͠ ΤϯίʔμɾσίʔμΛఏҊ w
ฒྻԽ͕༰қ w ڑґଘΛଊ͑Δ 5SBOTGPSNFSͷػߏ Ґஔූ߸Խ 8IBUػց༁ 3FTVMU405" ଞϞσϧʹඖఢ͢ΔੑೳΛ ΑΓগͳֶ͍शίετͰ࣮ݱ
ࠓճͷ༰ ࠷ॳͷϖʔδ ༰ղઆ จಡΉͱ͖ʹԿΛߟ͍͑ͯΔ͔ʁ ͦͷޙͷల։ ͜ͷจ୯ମͷཧղʹͱͲ·Βͣ จͷಡΈํɾͰͷҐஔ͚ΛΔ 5SBOTGPSNFSఏҊจΛಡΉ 7BTXBOJ
"TIJTI FUBM"UUFOUJPOJTBMMZPVOFFE"EWBODFTJOOFVSBMJOGPSNBUJPOQSPDFTTJOHTZTUFNT
ͦͷޙͷల։
ੜϞσϧͷੜ Τϯίʔμͱσίʔμׂ͕ҟͳΔ Τϯίʔμ σίʔμ ೖྗܥྻͷ$POUFYUVBMJ[BUJPO ࣗݾճؼతͳܥྻੜ
ੜϞσϧͷੜ Τϯίʔμͱσίʔμׂ͕ҟͳΔ Τϯίʔμ σίʔμ ೖྗܥྻͷ$POUFYUVBMJ[BUJPO ࣗݾճؼతͳܥྻੜ ಛநग़ثͱͯ͠ͷ׆༻ #&357J5FUD ੜϞσϧͱͯ͠ͷ׆༻
(15-MBNBFUD ͦΕͧΕΛϕʔεͱͨ͠৽ͨͳϞσϧ͕ੜ
ֶशύϥμΠϜͷมભ ݱࡏ εΫϥονֶश ϑΝΠϯνϡʔχϯά *ODPOUFYUMFBSOJOH ಛఆͷλεΫʹಛԽͨ͠ϞσϧΛ ϥϯμϜͳΛͱΔύϥϝλ͔Βֶश
ࣄલֶशࡁΈϞσϧΛ ݸผλεΫ͚ʹඍௐ ϞσϧͦͷͷΛௐͤͣ ࢦࣔʹै༷ͬͯʑͳλεΫΛ͜ͳ͢ w ࠶ؼܕωοτϫʔΫ w ΈࠐΈωοτϫʔΫ w 5SBOTGPSNFS w #&35 w (15 w 3FT/FU w (15 w -MBNB w 1B-.
ֶशύϥμΠϜͷมભ ݱࡏ εΫϥονֶश ϑΝΠϯνϡʔχϯά *ODPOUFYUMFBSOJOH ಛఆͷλεΫʹಛԽͨ͠ϞσϧΛ ϥϯμϜͳΛͱΔύϥϝλ͔Βֶश
ࣄલֶशࡁΈϞσϧΛ ݸผλεΫ͚ʹඍௐ ϞσϧͦͷͷΛௐͤͣ ࢦࣔʹै༷ͬͯʑͳλεΫΛ͜ͳ͢ w ࠶ؼܕωοτϫʔΫ w ΈࠐΈωοτϫʔΫ w 5SBOTGPSNFS w #&35 w (15 w 3FT/FU w (15 w -MBNB w 1B-. #&35ʹΑΔϑΝΠϯνϡʔχϯά(15ʹΑΔ*ODPOUFYUMFBSOJOH ͕ಛʹΤϙοΫϝΠΩϯά
ࣗݾҙͰදݱ͞Ε͍ͯΔࣝͱʁ Ϟσϧ͕ͲͷΑ͏ͳࣝΛ͍࣋ͬͯΔ͔Λௐࠪ͢ΔݚڀΛ ϓϩʔϏϯά QSPCJOH ͱݺͿ
ࣗݾҙͰදݱ͞Ε͍ͯΔࣝͱʁ Ϟσϧ͕ͲͷΑ͏ͳࣝΛ͍࣋ͬͯΔ͔Λௐࠪ͢ΔݚڀΛ ϓϩʔϏϯά QSPCJOH ͱݺͿ #&35ͷ࡞ΔຒΊࠐΈ͔ΒΓड͚ߏ͕͓͓ΉͶநग़Ͱ͖Δʢࠇ͕ਖ਼ղɺ੨͕#&35͔Βநग़ͨ͠Γड͚ߏʣ<)FXJUU 'JH> ໌ࣔతʹֶश͍ͯ͠ͳ͍ࣝͷ֫ಘՄೳੑ<$MBSL
'JH>
ςΩετΛ͑ͨ׆༂ %PTPWJUTLJZ 'JH 3BEGPSE 'JH
7JTJPO5SBOTGPSNFS $-*1 ࣗݾҙͷՄೳੑΛ୳Δޙଓݚڀ͕ଟൃ
·ͱΊ ର߅അԿ͔ ओఏҊԿ͔ ධՁͷ8IBU3FTVMU ࠶ؼܕωοτϫʔΫ ΈࠐΈωοτϫʔΫ ࣗݾҙͰ݁ͨ͠ ΤϯίʔμɾσίʔμΛఏҊ w
ฒྻԽ͕༰қ w ڑґଘΛଊ͑Δ 5SBOTGPSNFSͷػߏ Ґஔූ߸Խ 8IBUػց༁ 3FTVMU405" ଞϞσϧʹඖఢ͢ΔੑೳΛ ΑΓগͳֶ͍शίετͰ࣮ݱ
จͷߏ ΞϒετϥΫτ ΠϯτϩμΫγϣϯ ؔ࿈ݚڀ ఏҊख๏ ࣮ݧઃఆɾ݁Ռɾٞ ݁ ⁞֓ཁΛ௫Ή ओுΛ௫Ή
ॏΈ͚ͯ͠ಡΉ
ࠓճͷ༰ ࠷ॳͷϖʔδ ༰ղઆ จಡΉͱ͖ʹԿΛߟ͍͑ͯΔ͔ʁ ͦͷޙͷల։ ͜ͷจ୯ମͷཧղʹͱͲ·Βͣ จͷಡΈํɾͰͷҐஔ͚ΛΔ 5SBOTGPSNFSఏҊจΛಡΉ 7BTXBOJ
"TIJTI FUBM"UUFOUJPOJTBMMZPVOFFE"EWBODFTJOOFVSBMJOGPSNBUJPOQSPDFTTJOHTZTUFNT