Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
How to start studying NLP 02
Search
kabayan55
February 18, 2019
Programming
7
5.3k
How to start studying NLP 02
kabayan55
February 18, 2019
Tweet
Share
More Decks by kabayan55
See All by kabayan55
My favorite tool 2019
kabayan55
2
1.7k
Escalators are Awesome
kabayan55
2
1.5k
How to start studying NLP
kabayan55
0
360
Other Decks in Programming
See All in Programming
Docコメントで始める簡単ガードレール
keisukeikeda
1
110
The Past, Present, and Future of Enterprise Java
ivargrimstad
0
430
RAGでハマりがちな"Excelの罠"を、データの構造化で突破する
harumiweb
9
2.8k
Codexに役割を持たせる 他のAIエージェントと組み合わせる実務Tips
o8n
4
1.3k
AI主導でFastAPIのWebサービスを作るときに 人間が構造化すべき境界線
okajun35
0
700
grapheme_strrev関数が採択されました(あと雑感)
youkidearitai
PRO
1
210
コードレビューをしない選択 #でぃーぷらすトウキョウ
kajitack
3
890
The Past, Present, and Future of Enterprise Java
ivargrimstad
0
190
守る「だけ」の優しいEMを抜けて、 事業とチームを両方見る視点を身につけた話
maroon8021
3
750
Claude Code Skill入門
mayahoney
0
220
社内規程RAGの精度を73.3% → 100%に改善した話
oharu121
13
7.9k
TipKitTips
ktcryomm
0
160
Featured
See All Featured
How to Ace a Technical Interview
jacobian
281
24k
SEOcharity - Dark patterns in SEO and UX: How to avoid them and build a more ethical web
sarafernandez
0
140
Making the Leap to Tech Lead
cromwellryan
135
9.8k
Bioeconomy Workshop: Dr. Julius Ecuru, Opportunities for a Bioeconomy in West Africa
akademiya2063
PRO
1
69
Measuring Dark Social's Impact On Conversion and Attribution
stephenakadiri
1
150
Making Projects Easy
brettharned
120
6.6k
AI Search: Where Are We & What Can We Do About It?
aleyda
0
7.1k
Visualization
eitanlees
150
17k
The Organizational Zoo: Understanding Human Behavior Agility Through Metaphoric Constructive Conversations (based on the works of Arthur Shelley, Ph.D)
kimpetersen
PRO
0
270
Bridging the Design Gap: How Collaborative Modelling removes blockers to flow between stakeholders and teams @FastFlow conf
baasie
0
470
Accessibility Awareness
sabderemane
0
77
The SEO identity crisis: Don't let AI make you average
varn
0
410
Transcript
ʲॳ৺ऀ͚ʳ ɹ͡ΊͯΈΑ͏ʂࣗવݴޠॲཧ ɹɹࣗવݴޠॲཧͷੈքɺΑ͏ͦ͜ αϙʔλʔζ$P-BCษڧձ ݄ LBCBZBO
LBCBZBO େֶɾେֶӃͷݚڀͰࣗવݴޠॲཧ 8FCܥاۀ৽ଔ σʔλαΠΤϯεΤϯδχΞ ࣗݾհ
Agenda ࣗવݴޠॲཧͰͰ͖Δ͜ͱ ࣗવݴޠॲཧͷษڧ๏
Agenda ࣗવݴޠॲཧͰͰ͖Δ͜ͱ ࣗવݴޠॲཧͷษڧ๏
ࣗવݴޠΛίϯϐϡʔλͰॲཧ͢Δ ࣗવݴޠɿਓ͕ؒৗతʹͬͯΔݴޠ ɹɹɹɹɹྫ ຊޠɺӳޠ ੜ·Εͨͱ͖͔Βۙʹ͋ΔࣗવݴޠΛ ίϯϐϡʔλͰॲཧͰ͖Δͬͯ ͳΜ͔ͩͦ͢͝͏ʂ ʜʜͱ࠷ॳࢲࢥ͍·ͨ͠ ࣗવݴޠॲཧͬͯͳʹʁ
֓ཁਤ ⽂書分類 ⾃動要約 情報抽出 機械翻訳 質問応答 情報検索 評判分析 形態素解析 構⽂解析
意味解析 要素技術 複合技術 etc.
ܗଶૉղੳ ܗଶૉʢ୯ޠʣʹ͚ͯࢺผ .F$BC +6."/ͳͲ $ mecab すもももももももものうち すもも 名詞,⼀般,*,*,*,*,すもも,スモモ,スモモ も 助詞,係助詞,*,*,*,*,も,モ,モ もも 名詞,⼀般,*,*,*,*,もも,モモ,モモ
も 助詞,係助詞,*,*,*,*,も,モ,モ もも 名詞,⼀般,*,*,*,*,もも,モモ,モモ の 助詞,連体化,*,*,*,*,の,ノ,ノ うち名詞,⾮⾃⽴,副詞可能,*,*,*,うち,ウチ,ウチ EOS ཁૉٕज़
ߏจղੳ ,/1 $BCP$IB ͳͲ ཁૉٕज़ Wikipedia より
ҙຯղੳ ߏจతᐆດੑ͕͋Δͱ͖ ҙຯղੳ͕ඞཁ ྫ ʮ಄͕͍ڕΛ৯Δೣʯ தଜ໌༟͞Μ !OLNS@BLJ ͷ5XJUUFSΑΓ ཁૉٕज़
จॻྨ จॻΛΧςΰϦ͝ͱʹ͚Δ ࣗಈཁ จষΛࣗಈͰཁ͢Δ ใநग़ ΩʔϫʔυΛநग़͢Δ ྫʣΠϕϯτใநग़ɺใநग़ ෳ߹ٕज़
ෳ߹ٕज़ ධੳ ྫ ϨϏϡʔจ Positive Negative ͜ͷέʔΩ͍ͪ͝ͷ ͕͞ࡍཱͬͯඒຯͰͨ͠ɻ ·ͨߪೖ͍ͨ͠Ͱ͢ɻ ΫϦʔϜ͕͗ͨ͢ɻ
εϙϯδ͕ύαύαͩͬͨɻ
ෳ߹ٕज़ ػց༁ ใݕࡧ ࣭Ԡ
୯ޠΛϕΫτϧͰදݱͰ͖Δ ୯ޠͷ͠ࢉҾ͖ࢉ͕Ͱ͖Δ ྫ LJOHrNBO XPNBORVFFO ୯ޠͷྨࣅ͕Θ͔Δ χϡʔϥϧωοτϫʔΫ ٕज़հ8PSE7FD King Queen
Woman Man
8PSE7FDͱͷҧ͍ɿ׆༻ܗΛ·ͱΊΒΕΔ ྫ HP HPJOH HPFTˠHP ٕज़հGBTU5FYU
݄ʹ(PPHMF͕ެ։ ൚༻తͳϞσϧ ϑΝΠϯνϡʔχϯάͰߴ͍ਫ਼Λग़͢ ٕज़հ#&35
Agenda ࣗવݴޠॲཧͰͰ͖Δ͜ͱ ࣗવݴޠॲཧͷษڧ๏
ࢲPythonΛ༻͍ͯ͠·͢ Python͕ਓؾʂ ϝϦοτ ! εΫϦϓτݴޠͳͷͰ͙͢ʹ࣮ߦͰ͖Δ ! ๛ͳϥΠϒϥϦ ɹ/VNQZ 4DJQZ /-5, 4DJLJUMFBSO ϓϩάϥϛϯάݴޠʁ
͓͢͢Ίڭࡐ
ݴޠॲཧຊϊοΫ http://www.cl.ecei.tohoku.ac.jp/nlp100/
ݴޠॲཧຊϊοΫ ! ౦େͷԬ࡚ઌੜ͕࡞ͨ͠ νϡʔτϦΞϧ ! Pythonͷ࿅शʹͳΔ ! ݴޠॲཧʹඞཁͳ࣮͜͜ͰֶΔ ! GitHubʹίʔυΛ্͛ͯΔͻͱଟ͘ɺ ଞͷਓͷίʔυΛࢀߟʹͰ͖ΔͷͰ ಠֶ͍͢͠
ݴޠॲཧຊϊοΫ
ݴޠॲཧຊϊοΫ GitHubͰ “NLP100knock” ͱ ݕࡧ͢Δ͚ͩͰɺ 86 ϦϙδτϦ ݟ͔ͭΔ ˞20189݄࣌
ར༻ऀͨ͘͞Μ ͍·͢
/-1ϓϩάϥϛϯάνϡʔτϦΞϧ http://phontron.com/teaching.php
/-1ϓϩάϥϛϯάνϡʔτϦΞϧ http://phontron.com/teaching.php
/-1ϓϩάϥϛϯάνϡʔτϦΞϧ ! ΧʔωΪʔϝϩϯେֶͷ Graham Neubig ઌੜ͕࡞ͨ͠ νϡʔτϦΞϧ ! εϥΠυܗࣜ ! ֤νϡʔτϦΞϧʹԋश͕͋Γɺ ٖࣅίʔυͱߨٛεϥΠυΛࢀߟʹ ࣮͢Δͱཧղ͕ਂ·Δ
! ࣜΑΓίʔυΛݟͨ΄͏͕ ཧղ͍͢͠ਓʹಛʹΦεεϝ
/-1ϓϩάϥϛϯάνϡʔτϦΞϧ ࢿྉɾԋशσʔλ ͔͜͜Β Ұׅμϯϩʔυʂ https://github.com/neubig/nlptutorial
ࣗવݴޠॲཧΛಠश͍ͨ͠ਓͷͨΊʹ http://cl.sd.tmu.ac.jp/prospective/prerequisite
ࣗવݴޠॲཧΛಠश͍ͨ͠ਓͷͨΊʹ टେֶ౦ژͷখொઌੜ͕ ! ֶ ! ӳޠ ! ϓϩάϥϛϯά ! ػցֶश ! ࣗવݴޠॲཧ ͷษڧͷํʹ͍ͭͯ ·ͱΊ͍ͯΔϖʔδ
ࣗવݴޠॲཧΛಠश͍ͨ͠ਓͷͨΊʹ ࠓճॳ৺ऀ͚ͷߨٛͳͷͰ հ͚ͩʹͱͲΊ͓͖ͯ·͕͢ Կͷษڧ͕ඞཁͰ Ͳ͏ษڧ͖͔͢ ஸೡʹΘ͔Γ͘͢·ͱ·͍ͬͯΔͷͰ ੋඇ͝ཡʹͳͬͯ΄͍͠Ͱ͢ʂ
⻑岡技術科学⼤学⾃然⾔語処理研究室(YouTube) IUUQTXXXZPVUVCFDPNVTFSKOMQPSH ʮษڧձʯ͔ΒݟΔͱྑ͍ͱࢥ͍·͢
LBHHMF ࣗવݴޠॲཧܥͷίϯϖ͋Δ Θͨ͠/-1ͷίϯϖग़ͨ͜ͱͳ͍Ͱ͢
ࣗવݴޠॲཧΤϯδχΞʹͳΓ͍ͨਓ ! ػցֶशΤϯδχΞʹͳͬͯ ࣗવݴޠॲཧΔ ! ࣗવݴޠॲཧٕज़ʹಛԽͨ͠اۀʹߦ͘
ػցֶशΤϯδχΞʹͳΓ͍ͨਓ Φεεϝॻ੶ ʰػցֶशΤϯδχΞʹͳΓ͍ͨਓͷ ɹͨΊͷຊ"*Λఱ৬ʹ͢Δʱ ! ԿΛ͢Ε͍͍͔۩ମత
&OKPZ 4UVEZJOH /-1