Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
修論発表.pdf
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Hayato Tsukagoshi
September 29, 2024
170
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
修論発表.pdf
修論発表会にて使用した発表スライドです。
Hayato Tsukagoshi
September 29, 2024
More Decks by Hayato Tsukagoshi
See All by Hayato Tsukagoshi
【輪講資料】Moshi: a speech-text foundation model for real-time dialogue
hpprc
3
1.3k
Word Embeddings Are Steers for Language Models
hpprc
1
320
NLP2024 招待論文セッション: 定義文を用いた文埋め込み構成法
hpprc
1
190
YANS2024: 目指せ国際会議!「あぶない国際会議」
hpprc
0
350
Isotropy, Clusters, and Classifiers
hpprc
3
1k
[輪講資料] Matryoshka Representation Learning
hpprc
5
2.8k
[輪講資料] Text Embeddings by Weakly-Supervised Contrastive Pre-training
hpprc
4
1.5k
[輪講資料] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
hpprc
1
1.2k
WhitenedCSE: Whitening-based Contrastive Learning of Sentence Embeddings
hpprc
3
980
Featured
See All Featured
Conquering PDFs: document understanding beyond plain text
inesmontani
PRO
4
2.8k
Building AI with AI
inesmontani
PRO
1
1.1k
The Spectacular Lies of Maps
axbom
PRO
1
820
RailsConf & Balkan Ruby 2019: The Past, Present, and Future of Rails at GitHub
eileencodes
141
35k
Ruling the World: When Life Gets Gamed
codingconduct
0
260
Darren the Foodie - Storyboard
khoart
PRO
3
3.4k
エンジニアに許された特別な時間の終わり
watany
107
250k
10 Git Anti Patterns You Should be Aware of
lemiorhan
PRO
659
62k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
31
2.8k
AI Search: Where Are We & What Can We Do About It?
aleyda
0
7.6k
A Soul's Torment
seathinner
6
3k
WCS-LA-2024
lcolladotor
0
660
Transcript
ҟͳΔڭࢣ৴߸͔Βߏஙͨ͠ จϕΫτϧͷൺֱͱ౷߹ ਖ਼ࢦಋڭһ: ా ߒҰ ෭ࢦಋڭһ: ྒྷฏ 252106192 ௩ӽ
ॣ
จϕΫτϧ: ࣗવݴޠจͷϕΫτϧදݱ 2 จϕΫτϧۭؒ ͜Ͳ͕Ոʹ͔͍ͬͯΔɻ ͜Ͳֶ͕ߍ͔ΒՈʹ͔͍ͬͯΔɻ ͜Ͳ͕ਤॻؗʹ͍Δɻ ͜Ͳ͕ޕޙʹา͍͍ͯΔɻ จಉ࢜ͷҙຯ͕͍ۙ
ϕΫτϧಉ͕͍࢜ۙ
จϕΫτϧ: ࣗવݴޠจͷϕΫτϧදݱ 3 จϕΫτϧۭؒ ͜Ͳ͕Ոʹ͔͍ͬͯΔɻ ͜Ͳֶ͕ߍ͔ΒՈʹ͔͍ͬͯΔɻ ͜Ͳ͕ਤॻؗʹ͍Δɻ ͜Ͳ͕ޕޙʹา͍͍ͯΔɻ จಉ࢜ͷҙຯ͕͍ۙ
ϕΫτϧಉ͕͍࢜ۙ
จϕΫτϧ: ࣗવݴޠจͷϕΫτϧදݱ 4 จϕΫτϧۭؒ ͜Ͳ͕Ոʹ͔͍ͬͯΔɻ ͜Ͳֶ͕ߍ͔ΒՈʹ͔͍ͬͯΔɻ ͜Ͳ͕ਤॻؗʹ͍Δɻ ͜Ͳ͕ޕޙʹา͍͍ͯΔɻ จಉ࢜ͷҙຯ͕͍ۙ
ϕΫτϧಉ͕͍࢜ۙ • ྨࣅจݕࡧ • ΫϥελϦϯά ɹͳͲ෯͍Ԡ༻
จϕΫτϧ: ࣗવݴޠจͷϕΫτϧදݱ 5 จϕΫτϧۭؒ ͜Ͳ͕Ոʹ͔͍ͬͯΔɻ ͜Ͳֶ͕ߍ͔ΒՈʹ͔͍ͬͯΔɻ ͜Ͳ͕ਤॻؗʹ͍Δɻ ͜Ͳ͕ޕޙʹา͍͍ͯΔɻ จಉ࢜ͷҙຯ͕͍ۙ
ϕΫτϧಉ͕͍࢜ۙ • ྨࣅจݕࡧ • ΫϥελϦϯά ɹͳͲ෯͍Ԡ༻ • ϕΫτϧͷ্࣭ • ϕΫτϧͷੑ࣭ཧղ ɹ͕༗༻ੑ্ʹ݁ จϕΫτϧʹ͍ͭͯͷཧղΛਂΊΔ ͜ͱ͕ࠓޙͷൃలͷͨΊʹॏཁ
طଘݚڀͷ՝ จϕΫτϧݚڀͷݱঢ় •Ұ෦ͷϕϯνϚʔΫλεΫͰͷੑೳධՁ͕ओ •ͦΕͧΕͷख๏ͷཧղͦ͜·ͰਐΜͰ͍ͳ͍ 6
طଘݚڀͷ՝ จϕΫτϧݚڀͷݱঢ় •Ұ෦ͷϕϯνϚʔΫλεΫͰͷੑೳධՁ͕ओ •ͦΕͧΕͷख๏ͷཧղͦ͜·ͰਐΜͰ͍ͳ͍ ࣮Ԡ༻ͱͷΪϟοϓ •จϕΫτϧʹٻΊΒΕΔੑ࣭Ԡ༻ࣄྫ͝ͱʹมԽ͢Δ • ݕࡧͰจҙΑΓτϐοΫΧςΰϦ͕ॏཁ • ࣭ԠͰ࣭ͱճ͕ۙ͘ʹ͢Δ͜ͱ͕ॏཁ
•Ԡ༻ࣄྫ͝ͱʹదͳੑ࣭Λ࣋ͭจϕΫτϧΛ͍͍ͨ •ʮख๏͝ͱʹͲͷΑ͏ͳੑ࣭͕͋Δ͔ʁʯۃΊͯॏཁͳ͍ 7
ڭࢣ৴߸ʹணͨ͠จϕΫτϧͷੳ •ػցֶशϞσϧͷৼΔ͍܇࿅σʔλɾख๏ʹେ͖͘ґଘ •ҟͳΔڭࢣ৴߸ʹج͍ͮͯߏங͞ΕͨจϕΫτϧͷੑ࣭Λੳ •ͦΕͧΕͷख๏ʹΑΔจϕΫτϧͷੑ࣭Λੳ 8
ڭࢣ৴߸ʹணͨ͠จϕΫτϧͷੳ •ػցֶशϞσϧͷৼΔ͍܇࿅σʔλɾख๏ʹେ͖͘ґଘ •ҟͳΔڭࢣ৴߸ʹج͍ͮͯߏங͞ΕͨจϕΫτϧͷੑ࣭Λੳ •ͦΕͧΕͷख๏ʹΑΔจϕΫτϧͷੑ࣭Λੳ ੑ࣭ൺֱʹదͨ͠จϕΫτϧख๏ •ྨࣅͨ͠ΞʔΩςΫνϟΛ͕࣋ͭҟͳΔڭࢣ৴߸Λ༻͍Δ •ϕϯνϚʔΫλεΫʹ͓͍ͯಉͷੑೳΛࣔ͢ 9
ڭࢣ৴߸ʹணͨ͠จϕΫτϧͷੳ •ػցֶशϞσϧͷৼΔ͍܇࿅σʔλɾख๏ʹେ͖͘ґଘ •ҟͳΔڭࢣ৴߸ʹج͍ͮͯߏங͞ΕͨจϕΫτϧͷੑ࣭Λੳ •ͦΕͧΕͷख๏ʹΑΔจϕΫτϧͷੑ࣭Λੳ ੑ࣭ൺֱʹదͨ͠จϕΫτϧख๏ •ྨࣅͨ͠ΞʔΩςΫνϟΛ͕࣋ͭҟͳΔڭࢣ৴߸Λ༻͍Δ •ϕϯνϚʔΫλεΫʹ͓͍ͯಉͷੑೳΛࣔ͢ ຊݚڀͰରͱ͢Δख๏ •SBERT: ࣄલֶशࡁΈݴޠϞσϧ
+ ࣗવݴޠਪλεΫ •DefSent: ࣄલֶशࡁΈݴޠϞσϧ + ఆٛจˠ୯ޠ༧ଌλεΫ 10
ࣄલֶशࡁΈݴޠϞσϧ •େنͳςΩετΛ༻͍ͨࣄલֶशʹΑͬͯݴޠࣝΛ֫ಘ •දྫ: BERT, RoBERTa, GPT-2, GPT-3 11 BERTͷ֓ཁਤ
SBERT: ࣗવݴޠਪλεΫʹجͮ͘ख๏ • ࣗવݴޠਪλεΫͰ จϕΫτϧϞσϧΛ܇࿅͢Δख๏ • ࣗવݴޠਪλεΫ: จϖΞͷҙຯؔΛ༧ଌ 12 จB
จA BERT BERT ໃ६ ؚҙ ͦͷଞ ϥϕϧ༧ଌ Pooling Pooling
SBERT: ࣗવݴޠਪλεΫʹجͮ͘ख๏ • ࣗવݴޠਪλεΫͰ จϕΫτϧϞσϧΛ܇࿅͢Δख๏ • ࣗવݴޠਪλεΫ: จϖΞͷҙຯؔΛ༧ଌ SBERTʹΑΔfine-tuningͷखॱ 0.
ࣄલֶशࡁΈݴޠϞσϧΛ༻ҙ 1. จϖΞΛͦΕͧΕจϕΫτϧʹ 2. ಘΒΕͨจϕΫτϧͷϖΞ͔Β จϖΞͷҙຯؔΛ༧ଌ 3. ਖ਼͍͠ҙຯؔΛ༧ଌͰ͖Δ Α͏ʹϞσϧΛ܇࿅ 13 จB จA BERT BERT ໃ६ ؚҙ ͦͷଞ ϥϕϧ༧ଌ Pooling Pooling
DefSent: ఆٛจˠ୯ޠ༧ଌλεΫʹجͮ͘ख๏ • ఆٛจˠ୯ޠ༧ଌλεΫʹΑͬͯ จϕΫτϧϞσϧΛ܇࿅͢Δख๏ 14 ఆٛจ จB จA w
|V| w1 w2 w3 ... BERT BERT ໃ६ ؚҙ ͦͷଞ ϥϕϧ༧ଌ BERT ୯ޠ༧ଌ Pooling Pooling Pooling
DefSent: ఆٛจˠ୯ޠ༧ଌλεΫʹجͮ͘ख๏ • ఆٛจˠ୯ޠ༧ଌλεΫʹΑͬͯ จϕΫτϧϞσϧΛ܇࿅͢Δख๏ DefSentʹΑΔfine-tuningͷखॱ 0. ࣄલֶशࡁΈݴޠϞσϧΛ༻ҙ 1. ఆٛจΛBERTʹೖྗͯ͠
จϕΫτϧΛ֫ಘ 2. ಘΒΕͨϕΫτϧ͔Βఆٛจ ʹରԠ͢Δ୯ޠΛ༧ଌ 3. ఆٛจ͕ද͢୯ޠͷ֬Λ ࠷େԽ͢ΔΑ͏ʹ܇࿅ 15 ఆٛจ จB จA w |V| w1 w2 w3 ... BERT BERT ໃ६ ؚҙ ͦͷଞ ϥϕϧ༧ଌ BERT ୯ޠ༧ଌ Pooling Pooling Pooling
ຊݚڀͷ֓ཁ 16 ڭࢣ৴߸ͷҧ͍ʹ ணͨ͠จϕΫτϧͷ ൺֱɾ౷߹ SBERT DefSent BERT ؚҙؔೝࣝͰ fine-tuning
ఆٛจ→୯ޠ ༧ଌͰfine-tuning จϕΫτϧ Ϟσϧ
ຊݚڀͷ֓ཁ 17 Semantic Textual Similarity (STS) ᶃ จͷιʔε ᶄ จϖΞͷදతྨࣅ
SentEval ᶅ ײɾ੍࣌ྨͳͲͷԼྲྀλεΫ ᶆ ݴޠֶతใͷྨλεΫ ൺֱ •SBERT→DefSent •DefSent→SBERT •ϚϧνλεΫֶश •Average •Concat ౷߹ ڭࢣ৴߸ͷҧ͍ʹ ணͨ͠จϕΫτϧͷ ൺֱɾ౷߹ SBERT DefSent BERT ؚҙؔೝࣝͰ fine-tuning ఆٛจ→୯ޠ ༧ଌͰfine-tuning จϕΫτϧ Ϟσϧ
ҟͳΔڭࢣ৴߸͔Βߏஙͨ͠ จϕΫτϧͷൺֱ
จϕΫτϧͷੑ࣭ൺֱ: STS 19 Semantic Textual Similarity (STS) ᶃ จͷιʔε ᶄ
จϖΞͷදతྨࣅ จϖΞ (จϕΫτϧͷϖΞ) ೖྗ ൺֱ؍ ਓؒධՁͱจϕΫτϧಉ࢜ͷ ྨࣅͱͷॱҐ૬ؔ ධՁࢦඪ STSͷධՁखॱ ᶃ จϕΫτϧϞσϧΛ༻ҙ ᶄ จϖΞͦΕͧΕΛจϕΫτϧʹม ᶅ จϕΫτϧͷϖΞͷྨࣅΛܭࢉ ᶆ ਓؒධՁͱͷ૬ؔΛܭࢉ จA จB จϕΫτϧϞσϧ ਓखධՁͱͷ ૬ؔͰධՁ จྨࣅ ᶄ ᶃ ᶅ ᶆ
จϕΫτϧͷੑ࣭ൺֱ: STS 20 Semantic Textual Similarity (STS) ᶃ จͷιʔε ᶄ
จϖΞͷදతྨࣅ จϖΞ (จϕΫτϧͷϖΞ) ೖྗ ൺֱ؍ ਓؒධՁͱจϕΫτϧಉ࢜ͷ ྨࣅͱͷॱҐ૬ؔ ධՁࢦඪ
จϕΫτϧͷੑ࣭ൺֱ: STS 21 Semantic Textual Similarity (STS) ᶃ จͷιʔε ᶄ
จϖΞͷදతྨࣅ จϖΞ (จϕΫτϧͷϖΞ) ೖྗ ൺֱ؍ ਓؒධՁͱจϕΫτϧಉ࢜ͷ ྨࣅͱͷॱҐ૬ؔ ධՁࢦඪ • 2ͭͷ؍ͰσʔληοτΛׂ • ੑೳͷมԽΛ؍
จϕΫτϧͷੑ࣭ൺֱ: STS 22 Semantic Textual Similarity (STS) ᶃ จͷιʔε ᶄ
จϖΞͷදతྨࣅ จϖΞ (จϕΫτϧͷϖΞ) ೖྗ ൺֱ؍ ਓؒධՁͱจϕΫτϧಉ࢜ͷ ྨࣅͱͷॱҐ૬ؔ ධՁࢦඪ • จͷιʔεʹΑͬͯੑೳʹࠩ • ֤ख๏ͷ܇࿅σʔληοτʹ͍ۙ จͷํ͕͏·͘ྨࣅΛଌΕΔ • 2ͭͷ؍ͰσʔληοτΛׂ • ੑೳͷมԽΛ؍
จϕΫτϧͷੑ࣭ൺֱ: STS 23 දతྨࣅͱੑೳͷؔ SBERT DefSent Semantic Textual Similarity (STS)
ᶃ จͷιʔε ᶄ จϖΞͷදతྨࣅ จϖΞ (จϕΫτϧͷϖΞ) ೖྗ ൺֱ؍ ਓؒධՁͱจϕΫτϧಉ࢜ͷ ྨࣅͱͷॱҐ૬ؔ ධՁࢦඪ • දతྨࣅʹΑͬͯੑೳࠩ • SBERT (ؚҙؔ)දతྨ ࣅͷӨڹΛड͚ͮΒ͍ • DefSent (ఆٛจ)දతʹྨ ࣅ͍ͯ͠ͳ͍จͷྨࣅΛ ൺֱతਖ਼͘͠ਪఆͰ͖Δ • จͷιʔεʹΑͬͯੑೳʹࠩ • ֤ख๏ͷ܇࿅σʔληοτʹ͍ۙ จͷํ͕͏·͘ྨࣅΛଌΕΔ • 2ͭͷ؍ͰσʔληοτΛׂ • ੑೳͷมԽΛ؍
จϕΫτϧͷੑ࣭ൺֱ: SentEval 24 SentEval ᶅ ԼྲྀλεΫ͝ͱͷੑೳ ᶆ ݴޠֶతใͷྨੑೳ จϕΫτϧ ೖྗ
ൺֱ؍ จϕΫτϧΛೖྗͱ͢Δ ઢܗྨثͷྨੑೳ ධՁࢦඪ จ ྨੑೳ͔Β จຒΊࠐΈͷ࣭ΛධՁ จϕΫτϧϞσϧ ྨث ᶄ ᶃ ᶅ ᶆ SentEvalͷධՁखॱ ᶃ จຒΊࠐΈϞσϧΛ༻ҙ ᶄ ֤จΛจϕΫτϧʹม ᶅ จϕΫτϧΛೖྗͱ͢ΔྨثΛ܇࿅ ᶆ ྨثͷੑೳ͔ΒจϕΫτϧͷ࣭ΛධՁ
จϕΫτϧͷੑ࣭ൺֱ: SentEval 25 SentEval ᶅ ԼྲྀλεΫ͝ͱͷੑೳ ᶆ ݴޠֶతใͷྨੑೳ จϕΫτϧ ೖྗ
ൺֱ؍ จϕΫτϧΛೖྗͱ͢Δ ઢܗྨثͷྨੑೳ ධՁࢦඪ
จϕΫτϧͷੑ࣭ൺֱ: SentEval 26 SentEval ᶅ ԼྲྀλεΫ͝ͱͷੑೳ ᶆ ݴޠֶతใͷྨੑೳ จϕΫτϧ ೖྗ
ൺֱ؍ จϕΫτϧΛೖྗͱ͢Δ ઢܗྨثͷྨੑೳ ධՁࢦඪ • SBERTҙຯతͳใΛ๛ ʹຒΊࠐΜͰ͍Δ • DefSentදతใ͕๛ • ϑϨʔζͷߏಘҙ
Length WordContent Tense SubjNumber จϕΫτϧͷੑ࣭ൺֱ: SentEval 27 SentEval ᶅ ԼྲྀλεΫ͝ͱͷੑೳ
ᶆ ݴޠֶతใͷྨੑೳ จϕΫτϧ ೖྗ ൺֱ؍ จϕΫτϧΛೖྗͱ͢Δ ઢܗྨثͷྨੑೳ ධՁࢦඪ • SBERTҙຯతͳใΛ๛ ʹຒΊࠐΜͰ͍Δ • DefSentදతใ͕๛ • ϑϨʔζͷߏಘҙ • DefSent੍࣌จத୯ޠͷใ ͳͲදతͳใ͕ൺֱత๛ 50 60 70 80 90 Length WordContent Tense จ༧ଌ 50 60 70 80 90 Length WordContent Tense จத୯ޠ༧ଌ ੍࣌༧ଌ • SBERTจͷදใ͕ॏཁͳλεΫ ͷੑೳ͕͍ • จத୯ޠͳͲͷใগͳΊ
จϕΫτϧͷੑ࣭ൺֱ: ·ͱΊ 28 Semantic Textual Similarity (STS) ᶃ จͷιʔε ᶄ
จϖΞͷදతྨࣅ จϖΞ (จϕΫτϧͷϖΞ) ೖྗ ൺֱ؍ ਓؒධՁͱจϕΫτϧಉ࢜ͷ ྨࣅͱͷॱҐ૬ؔ ධՁࢦඪ SentEval ᶅ ԼྲྀλεΫ͝ͱͷੑೳ ᶆ ݴޠֶతใͷྨੑೳ จϕΫτϧ ೖྗ ൺֱ؍ จϕΫτϧΛೖྗͱ͢Δ ઢܗྨثͷྨੑೳ ධՁࢦඪ
จϕΫτϧͷੑ࣭ൺֱ: ·ͱΊ 29 දతྨࣅͱੑೳͷؔ SBERT DefSent Semantic Textual Similarity (STS)
ᶃ จͷιʔε ᶄ จϖΞͷදతྨࣅ จϖΞ (จϕΫτϧͷϖΞ) ೖྗ ൺֱ؍ ਓؒධՁͱจϕΫτϧಉ࢜ͷ ྨࣅͱͷॱҐ૬ؔ ධՁࢦඪ SentEval ᶅ ԼྲྀλεΫ͝ͱͷੑೳ ᶆ ݴޠֶతใͷྨੑೳ จϕΫτϧ ೖྗ ൺֱ؍ จϕΫτϧΛೖྗͱ͢Δ ઢܗྨثͷྨੑೳ ධՁࢦඪ SBERT • λεΫ: ࣗવݴޠਪ • ײۃੑͳͲҙຯతใ͕๛ • දతใগͳΊ DefSent • λεΫ: ఆٛจˠ୯ޠ༧ଌ • จ੍࣌ͳͲදతใ͕๛ • ϑϨʔζͷߏಘҙ
ҟͳΔڭࢣ৴߸͔Βߏஙͨ͠ จϕΫτϧͷ౷߹
จϕΫτϧͷ౷߹ ੑ࣭ൺֱͷ݁Ռ •ಉ͡ϞσϧΛϕʔεͱ͠ɺҟͳΔڭࢣ৴߸Λ༻͍Δख๏Λൺֱ • SBERTͱDefSentҟͳΔੑ࣭Λ࣋ͭ͜ͱ͕Θ͔ͬͨ •ҟͳΔৼΔ͍Λ͢ΔϞσϧͷΞϯαϯϒϧ༗ͳखཱͯ • จϕΫτϧʹ͓͍ͯҟͳΔϞσϧͷ౷߹༗༻͔ʁ 31
จϕΫτϧͷ౷߹ ੑ࣭ൺֱͷ݁Ռ •ಉ͡ϞσϧΛϕʔεͱ͠ɺҟͳΔڭࢣ৴߸Λ༻͍Δख๏Λൺֱ • SBERTͱDefSentҟͳΔੑ࣭Λ࣋ͭ͜ͱ͕Θ͔ͬͨ •ҟͳΔৼΔ͍Λ͢ΔϞσϧͷΞϯαϯϒϧ༗ͳखཱͯ • จϕΫτϧʹ͓͍ͯҟͳΔϞσϧͷ౷߹༗༻͔ʁ ҟͳΔੑ࣭Λ࣋ͭจϕΫτϧͷ౷߹ •5ͭͷ౷߹ख๏ʹ͍࣮ͭͯݧ
• S+D, D+S • Multi • Average • Concat 32 • ౷߹ʹΑͬͯੑೳ্͢Δ͔ʁ • ࠓޙͷจϕΫτϧݚڀʹ͓͍ͯෳ ͷڭࢣ৴߸ͷΈ߹Θͤ༗͔ʁ
จϕΫτϧͷ౷߹: ୯ҰϞσϧͷ౷߹ 33 • S+D: ୯ҰϞσϧʹSBERT, DefSentʹΑΔfine-tuningΛॱʹ࣮ࢪ • D+S: ୯ҰϞσϧʹDefSent,
SBERTʹΑΔfine-tuningΛॱʹ࣮ࢪ • Multi: SBERTͱDefSentʹΑΔfine-tuningΛަޓʹ࣮ࢪ
จϕΫτϧͷ౷߹: ୯ҰϞσϧͷ౷߹ 34 • S+D: ୯ҰϞσϧʹSBERT, DefSentʹΑΔfine-tuningΛॱʹ࣮ࢪ • D+S: ୯ҰϞσϧʹDefSent,
SBERTʹΑΔfine-tuningΛॱʹ࣮ࢪ • Multi: SBERTͱDefSentʹΑΔfine-tuningΛަޓʹ࣮ࢪ
จϕΫτϧͷ౷߹: ୯ҰϞσϧͷ౷߹ 35 • S+D: ୯ҰϞσϧʹSBERT, DefSentʹΑΔfine-tuningΛॱʹ࣮ࢪ • D+S: ୯ҰϞσϧʹDefSent,
SBERTʹΑΔfine-tuningΛॱʹ࣮ࢪ • Multi: SBERTͱDefSentʹΑΔfine-tuningΛަޓʹ࣮ࢪ
จϕΫτϧͷ౷߹: ෳϞσϧͷ౷߹ 36 • Average: SBERT, DefSentʹΑΔจϕΫτϧΛฏۉ • Concat: SBERT,
DefSentʹΑΔจϕΫτϧΛ࿈݁
จϕΫτϧͷ౷߹: ෳϞσϧͷ౷߹ 37 • Average: SBERT, DefSentʹΑΔจϕΫτϧΛฏۉ • Concat: SBERT,
DefSentʹΑΔจϕΫτϧΛ࿈݁
จϕΫτϧͷ౷߹: ධՁ࣮ݧ 38 •౷߹ख๏͝ͱʹϞσϧΛ܇࿅ɾධՁ •STSͰ10ճ, SentEvalͰ3ճϞσϧΛ ܇࿅ͯ͠ฏۉੑೳΛใࠂ ࣮ݧઃఆ ධՁର •SBERT
•DefSent •S+D (SBERT→DefSent) •D+S (DefSent→SBERT) •Multi •Average •Concat ୯Ұख๏ͱੑೳΛൺֱ ධՁλεΫ •STS •SentEval
จϕΫτϧͷ౷߹: ධՁ࣮ݧ 39 BERT-base STS SentEval SBERT 73.19 86.49 DefSent
75.20 86.61 S+D 78.45 86.80 D+S 72.89 86.09 Multi 72.89 86.23 Average 77.82 87.47 Concat 76.03 87.93 ֤౷߹ख๏͝ͱͷSTSͱSentEvalͷฏۉੑೳ (%) SBERT→DefSentͱ Average͕ߴੑೳ • SentEvalͰConcatͷੑೳ͕ ྑ͍͕ɺจϕΫτϧͷ࣍ݩ͕ େ͖͘༗རͳͷͰҙ
จϕΫτϧͷ౷߹: ධՁ࣮ݧ 40 BERT-base STS SentEval SBERT 73.19 86.49 DefSent
75.20 86.61 S+D 78.45 86.80 D+S 72.89 86.09 Multi 72.89 86.23 Average 77.82 87.47 Concat 76.03 87.93 ֤౷߹ख๏͝ͱͷSTSͱSentEvalͷฏۉੑೳ (%) • ౷߹ख๏͕୯Ұख๏ ΛԼճΔ߹ • ഁ໓త٫ͷӨڹ͔ SBERT→DefSentͱ Average͕ߴੑೳ DefSent→SBERT ੑೳ͕ѱԽ • SentEvalͰConcatͷੑೳ͕ ྑ͍͕ɺจϕΫτϧͷ࣍ݩ͕ େ͖͘༗རͳͷͰҙ
จϕΫτϧͷ౷߹: ධՁ࣮ݧ 41 BERT-base STS SentEval SBERT 73.19 86.49 DefSent
75.20 86.61 S+D 78.45 86.80 D+S 72.89 86.09 Multi 72.89 86.23 Average 77.82 87.47 Concat 76.03 87.93 ֤౷߹ख๏͝ͱͷSTSͱSentEvalͷฏۉੑೳ (%) • SentEvalͰConcatͷੑೳ͕ ྑ͍͕ɺจϕΫτϧͷ࣍ݩ͕ େ͖͘༗རͳͷͰҙ • ౷߹ख๏͕୯Ұख๏ ΛԼճΔ߹ • ഁ໓త٫ͷӨڹ͔ • จϕΫτϧͷ୯७ฏۉ͕Α͍ੑೳ • ౷߹ख๏ʹΑΔϕΫτϧͷੑ࣭ੳ ࠓޙͷ՝ SBERT→DefSentͱ Average͕ߴੑೳ DefSent→SBERT ੑೳ͕ѱԽ
·ͱΊɾࠓޙͷ՝ 42 ౷߹ ڭࢣ৴߸ͷҧ͍ʹண͠จϕΫτϧͷ ੑ࣭Λൺֱੳɾ౷߹ ൺֱ • จͷιʔεʹΑΔख๏͝ͱͷੑೳ͕ࠩݦஶ SBERT •
ײۃੑͳͲҙຯతใ͕๛ • දతྨࣅͷӨڹΛड͚ͮΒ͍ DefSent • จ੍࣌ͳͲදతใ͕๛ • ϑϨʔζͷߏಘҙ • දతྨࣅ͕͍จϖΞʹڧ͍ • ౷߹ʹΑͬͯੑೳ্ • SBERT→DefSent Average͕ߴੑೳ • ഁ໓త٫ͷӨڹͰੑ ೳ͕Լ͢Δ߹ ੳର ɹSBERT: ࣗવݴޠਪϕʔε ɹDefSent: ఆٛจ→୯ޠ༧ଌϕʔε ࠓޙͷ՝ 1. ΑΓ൚ͳϞσϧɾจϕΫτϧख๏ͷௐࠪ 2. ౷߹ख๏Ͱߏ͞ΕͨϕΫτϧͷੑ࣭ੳ 3. ΑΓΑ͍౷߹ख๏ͷ։ൃ
·ͱΊɾࠓޙͷ՝ 43 ౷߹ ڭࢣ৴߸ͷҧ͍ʹண͠จϕΫτϧͷ ੑ࣭Λൺֱੳɾ౷߹ ൺֱ • จͷιʔεʹΑΔख๏͝ͱͷੑೳ͕ࠩݦஶ SBERT •
ײۃੑͳͲҙຯతใ͕๛ • දతྨࣅͷӨڹΛड͚ͮΒ͍ DefSent • จ੍࣌ͳͲදతใ͕๛ • ϑϨʔζͷߏಘҙ • දతྨࣅ͕͍จϖΞʹڧ͍ • ౷߹ʹΑͬͯੑೳ্ • SBERT→DefSent Average͕ߴੑೳ • ഁ໓త٫ͷӨڹͰੑ ೳ͕Լ͢Δ߹ ੳର ɹSBERT: ࣗવݴޠਪϕʔε ɹDefSent: ఆٛจ→୯ޠ༧ଌϕʔε ࠓޙͷ՝ 1. ΑΓ൚ͳϞσϧɾจϕΫτϧख๏ͷௐࠪ 2. ౷߹ख๏Ͱߏ͞ΕͨϕΫτϧͷੑ࣭ੳ 3. ΑΓΑ͍౷߹ख๏ͷ։ൃ
ത࢜ޙظ՝ఔͷల
ത࢜ޙظ՝ఔͷల •จϕΫτϧͷকདྷతͳൃలͷͨΊͷॏཁͳ՝ • طଘख๏จͷݶΒΕͨଆ໘ʹ͔͠ண͍ͯ͠ͳ͍ • طଘͷࣄલֶशࡁΈݴޠϞσϧจϕΫτϧͷදݱྗ͕ෆ ɺܭࢉྔେ͖͍ •͋ΒΏΔจϕΫτϧख๏ͷج൫ͱͯ͠༻͍Δ͜ͱ͕Ͱ͖Δɺ൚ ༻ੑʹ༏ΕͨϞσϧ͕ඞཁ จϕΫτϧͷͨΊͷج൫Ϟσϧͷ։ൃ
•ݚڀ1: จϕΫτϧͷੑ࣭ੳ •ݚڀ2: จϕΫτϧʹ͓͚Δج൫ϞσϧͷఏҊ •ݚڀ3: จϕΫτϧʹ͓͚Δج൫ϞσϧͷԠ༻ 45
ത࢜ޙظ՝ఔͷݚڀ 46
ത࢜ޙظ՝ఔͷݚڀ 47
ത࢜ޙظ՝ఔͷݚڀ 48
ݚڀܭը 49 . % % % ੑ࣭ੳɾ ৽نධՁఏҊ ج൫Ϟσϧͷ։ൃ ج൫ϞσϧͷԠ༻
ത࢜จ
ݚڀۀ ࠃจࢽ (ࠪಡ͋Γ) • ௩ӽॣ, ྒྷฏ, ాߒҰ. ఆٛจΛ༻͍ͨจຒΊࠐΈߏ๏, ࣗવݴޠॲཧ Vol.
30 No. 1 (ൃߦ༧ఆ). ࠃࡍձٞ (ࠪಡ͋Γ) • Hayato Tsukagoshi, Ryohei Sasano, Koichi Takeda. Comparison and Combination of Sentence Embeddings Derived from Different Supervision Signals, in Proceedings of the 11th Joint Conference on Lexical and Computational Semantics (*SEM 2022). • Hayato Tsukagoshi, Ryohei Sasano, Koichi Takeda. DefSent: Sentence Embeddings using Definition Sentences, in Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021). ࠃձٞ (ࠪಡͳ͠) • ཅాᠳฏ, ௩ӽॣ, ྒྷฏ, ాߒҰ. ΨεຒΊࠐΈʹجͮ͘จදݱੜ, ݴޠॲཧֶձ ୈ29ճ࣍େձ (NLP2023) ൃද༧ఆ. • ௩ӽॣ, ฏඌ, Լກ, ࠤࠀݾ, ྒྷฏ, ాߒҰ. ࣗવݴޠਪͱ࠶ݱثΛ༻͍ͨSplit and Rephraseʹ ͓͚Δੜจͷ্࣭, ݴޠॲཧֶձ ୈ28ճ࣍େձ (NLP2022). • ௩ӽॣ, ྒྷฏ, ాߒҰ. ఆٛจΛ༻͍ͨจຒΊࠐΈߏ๏, ݴޠॲཧֶձ ୈ27ճ࣍େձ (NLP2021). ͦͷଞ • 2023 ຊֶज़ৼڵձ ಛผݚڀһ-DC1 ࠾༻ఆ • 2023 ໊ݹେֶ༥߹ϑϩϯςΟΞϑΣϩʔೝఆ 50