Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Spacyでお手軽NLP / NLP with spacy
Search
himkt
June 13, 2018
Programming
0
980
Spacyでお手軽NLP / NLP with spacy
2018/06/13のレトリバセミナーのスライドです
himkt
June 13, 2018
Tweet
Share
More Decks by himkt
See All by himkt
Linformer: paper reading
himkt
0
370
RoBERTa: paper reading
himkt
1
300
NLP SoTA 勉強会 / ner_2019
himkt
2
1.3k
自然言語処理 @ クックパッド / nlp at cookpad
himkt
1
480
Interpretable Machine Learning 6.3 - Prototypes and Criticisms
himkt
2
130
ニューラル固有表現抽出 / Neural Named Entity Recognition
himkt
3
650
ニューラル固有表現抽出器を実装してみる / PyNER
himkt
6
2k
Deep Learning Book 10その2 / deep learning book 10 vol2
himkt
2
170
ふわふわ系列ラベリング / ner 2018
himkt
5
840
Other Decks in Programming
See All in Programming
CSC509 Lecture 09
javiergs
PRO
0
140
TypeScriptでライブラリとの依存を限定的にする方法
tutinoko
3
700
ふかぼれ!CSSセレクターモジュール / Fukabore! CSS Selectors Module
petamoriken
0
150
React への依存を最小にするフロントエンド設計
takonda
10
2.3k
Laravel や Symfony で手っ取り早く OpenAPI のドキュメントを作成する
azuki
2
120
AI時代におけるSRE、 あるいはエンジニアの生存戦略
pyama86
6
1.2k
3 Effective Rules for Using Signals in Angular
manfredsteyer
PRO
1
100
CSC509 Lecture 12
javiergs
PRO
0
160
Micro Frontends Unmasked Opportunities, Challenges, Alternatives
manfredsteyer
PRO
0
110
Why Jakarta EE Matters to Spring - and Vice Versa
ivargrimstad
0
1.2k
Tauriでネイティブアプリを作りたい
tsucchinoko
0
370
[Do iOS '24] Ship your app on a Friday...and enjoy your weekend!
polpielladev
0
110
Featured
See All Featured
Embracing the Ebb and Flow
colly
84
4.5k
Mobile First: as difficult as doing things right
swwweet
222
8.9k
Become a Pro
speakerdeck
PRO
25
5k
What's new in Ruby 2.0
geeforr
343
31k
Building Your Own Lightsaber
phodgson
103
6.1k
The Power of CSS Pseudo Elements
geoffreycrofte
73
5.3k
Thoughts on Productivity
jonyablonski
67
4.3k
Producing Creativity
orderedlist
PRO
341
39k
The Cost Of JavaScript in 2023
addyosmani
45
6.8k
Designing Experiences People Love
moore
138
23k
Bootstrapping a Software Product
garrettdimon
PRO
305
110k
Making Projects Easy
brettharned
115
5.9k
Transcript
Ͱ͓खܰ/-1 )JSBNBUTV!ϨτϦόηϛφʔ ը૾IUUQTHJUIVCDPNFYQMPTJPOTQB$ZCMPCNBTUFSXFCTJUFBTTFUTJNHMPHPTWH
Tsukuba, M2, NLP himkt
5-%3 w 1ZUIPOͷࣗવݴޠॲཧϥΠϒϥϦͰ͋Δ4QB$Zͷհ w Wͷ͓͠Ζػೳʹ͍ͭͯ w 4QB$ZͰຊޠςΩετΛॲཧ͢Δ
"CPVU4QB$Z ը૾IUUQTTQBDZJP https://spacy.io
IUUQTTQBDZJP "CPVU4QB$Z “Industrial-Strength NLP” w /POEFTUSVDUJWFUPLFOJ[BUJPO w /BNFEFOUJUZSFDPHOJUJPO w 4VQQPSUGPS
MBOHVBHFT w TUBUJTUJDBMNPEFMTGPSMBOHVBHFT w ʜFUD IUUQTHJUIVCDPNFYQMPTJPOTQB$Z
#BTJDVTBHFPG4QB$Z
.BOZ'FBUVSFT ը૾IUUQTTQBDZJPVTBHFGBDUTpHVSFT
'BTUFTUJOUIFXPSME w %FQFOEFODZQBSTFSͷύϑΥʔϚϯεൺֱ w จIUUQTBDMXFCPSHBOUIPMPHZ111QEG w จʹ͋Δͷ41&&%ͷදͰɼ"DDVSBDZࣗલͰ࡞ΒΕͨͷʁ w $IPJͷϕϯνϚʔΫ࣌ʹTQB$ZWະϦϦʔεͳͷͰOB
ը૾IUUQTTQBDZJPVTBHFGBDUTpHVSFT
4QFFEDPNQBSJTPOXJUIPUIFSMJCSBSJFT w ଞͷࣗવݴޠॲཧϥΠϒϥϦͱͷൺֱ w จͰͳ͘࡞ऀ͕ௐࠪͨ͠ͷ DPOEVDUFEJO w ϦϙδτϦIUUQTHJUIVCDPNFYQMPTJPOTQBDZCFODINBSLT
ը૾IUUQTTQBDZJPVTBHFGBDUTpHVSFT
.PEFMDPNQBSJTPO w ݴޠʹΑͬͯෳͷαΠζͷϞσϧ͕͋Δ FO GS FT w 104UBHHFS /&3UBHHFS
%FQFOEFODZQBSTFS w 8JUIPVUBOZQSFQSPDFTTJOH EBUBTFU ը૾IUUQTTQBDZJPVTBHFGBDUTpHVSFT
/-5,BOE4QB$Z w /-5,P⒎FSTTPNFPGUIFTBNFGVODUJPOBMJUZBTTQB$Z w *ODPNQBSJTPOUPTQB$Z /-5,UBLFTBNVDINPSF CSPBEDIVSDIBQQSPBDI w TQB$ZJTBMTPNVDINPSFQFSGPSNBODFGPDVTTFEUIBO/-5, XIFSFUIFUXPMJCSBSJFTQSPWJEFUIFTBNFGVODUJPOBMJUZ
TQB$ZTJNQMFNFOUBUJPOXJMMVTVBMMZCFGBTUFSBOENPSF BDDVSBUF Ҿ༻IUUQTTQBDZJPVTBHFGBDUTpHVSFT
0UIFSMJCSBSJFTBOETQB$Z 1ZUPSDIIUUQTHJUIVCDPNQZUPSDIQZUPSDICMPCNBTUFSEPDTTPVSDF@TUBUJDJNHQZUPSDIMPHPEBSLTWH "MMFO/-1IUUQTHJUIVCDPNBMMFOBJBMMFOOMQCMPCNBTUFSEPDTUBUJDBMMFOOMQMPHPEBSLQOH (FOTJNIUUQTHJUIVCDPN3B3F5FDIOPMPHJFTHFOTJNCMPCEFWFMPQEPDTTSDSFBENF@JNBHFTSBSFQOH $V1ZIUUQTHJUIVCDPNDVQZDVQZCMPCNBTUFSEPDTJNBHFDVQZ@MPHP@QYQOH JOUPSDIUFYU GPS(16BDDFMFSBUJPO XPSEWFDUPS
QJQFMJOF
4QB$ZWͷݸਓతʹ͖ͳػೳ w EJTQMB$ZͰ͌͢Εʔ͠ʔʁ w 4QB$ZͷՄࢹԽϞδϡʔϧ w ͖Ε͍Ͱ͍͍ײ͡ͳը૾Λ࡞ͬͯ͘ΕΔ w ݻ༗දݱநग़ͱΓड͚ղੳͷ݁ՌΛՄࢹԽͯ͘͠ΕΔ w
4QB$ZͰղੳͨ͠ΦϒδΣΫτΛͦͷ··͑Δ w 47(ܗࣜͷը૾͕ग़ྗ͞ΕΔ
8FCαΠτ্ͷ/&3ͷσϞ ը૾IUUQTFYQMPTJPOBJEFNPTEJTQMBDZFOU
4QB$ZY+VQZUFS/PUFCPPL
4QB$ZBOEຊޠ w WͰͷຊޠରԠ 13 w ຊޠܗଶૉղੳث+BOPNFΛϥοϓ͢ΔܗͰ࣮ w WͰͷܗଶૉղੳثҠߦ *TTVF
13 w ຊޠ6OJWFSTBM%FQFOEFODZσʔλ6OJ%JDͰׂ͞Ε͍ͯΔ w +BOPNFݱࡏͷͱ͜Ζ6OJ%JDʹະରԠ w .F$BCʹҠߦ
4QB$ZBOEຊޠ
4QB$ZBOEຊޠ "OTXFSVTF6OJ%JD
4QB$ZBOEຊޠ
ຊޠ/&3%FQFOEFODZQBSTJOHXJUI4QB$Z w ݁ݱࡏͰ͖ͳ͍ w 4QB$ZʹࣗͰ5BHHFS1BSTFSΛֶशͰ͖Δ ػߏ͕Έࠐ·Ε͍ͯΔ w ຊޠ6OJWFSTBM%FQFOEFODJFTެ։͞Ε͍ͯΔ IUUQTHJUIVCDPN6OJWFSTBM%FQFOEFODJFT6%@+BQBOFTF(4%ͳͲ
Ϟσϧࣗ࡞Ͱ͖ΔͷͰʂʁ
"EEJOH-BOHVBHFTVQQPSU 4QB$ZͷݴޠϞδϡʔϧͷ ίϯϙʔωϯτ ը૾IUUQTTQBDZJPVTBHFBEEJOHMBOHVBHFTTFDUJPOUSBJOJOH
"EEJOH-BOHVBHFTVQQPSU ը૾IUUQTTQBDZJPVTBHFBEEJOHMBOHVBHFTTFDUJPOUSBJOJOH 1PXFSFECZ.F$BC 4QB$ZͷֶशϞδϡʔϧ͕͑ͳ͍ʁ
ຊޠTQB$Zͷݱࡏͷʁ w ࣙॻ͕6OJ%JDͰ͋Δ͜ͱ͕ఆ͞Ε͍ͯΔ JTTVF w ʢ͓ͦΒ͘ʣଟ͘ͷڥͰ.F$BCͷσϑΥϧτࣙॻ*1"EJD w <50%0>+BQBOFTF5PLFOJ[FSͰ5BHHFSΛ࡞͍ͬͯΔ͕ɼ ͜ͷίϯετϥΫλͷҾ֎෦͔Β৮Εͳͦ͏
w ൃԻ͕ະొͷ୯ޠΛղੳ͢ΔͱΤϥʔ *1"EJD 13 w ࣙॻ͝ͱͷग़ྗͷࠩҟΛٵऩ͢ΔΠϯλʔϑΣʔε͕ඞཁʁ w /&3%FQFOEFODZ1BSTJOHͷϞσϧ·ͩଘࡏ͠ͳ͍ w ݱࡏA"MQIBUPLFOJ[BUJPOTVQQPSUA w ͔ͪॻ͖͕ඞཁͳݴޠͷରԠͲ͏͢Εʜ w தࠃޠࣅͨঢ়گͰࢭ·͍ͬͯΔ 13
͜ΕΛΕͱΓ͋͑ͣςετͰ͖ͦ͏ USBWJTͰ.F$BCΛΠϯετʔϧ͢ΔΑ͏ʹ͢Δ TQBDZNPEFMTʹ6OJ%JDΛొ͢Δ QZUIPONTQBDZEPXOMPBEKBΛඋ͢Δ /&3ͱ%FQFOEFODZ1BSTJOH͋ͱͰௐΔʜ
·ͱΊ w ࣗવݴޠॲཧϥΠϒϥϦTQB$Zͷհ w *OEVTUSZൃͷࣗવݴޠॲཧϥΠϒϥϦ w WͰೖͬͨՄࢹԽϞδϡʔϧ͕͍͍ײ͡ w EJTQMB$ZΛ͍͍ͬͨײ͡ͳՄࢹԽ w
TQB$ZͰͷຊޠςΩετॲཧ·ͩෆશ w ݱঢ়ܗଶૉղੳͷΠϯλʔϑΣʔε QJQJOTUBMMTQBDZ 3VO5IJTDPNNBOE