Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
20180414_WSDM2018_reading_YoheiKIKUTA
Search
yoppe
April 12, 2018
Science
750
0
Share
20180414_WSDM2018_reading_YoheiKIKUTA
HP:
https://atnd.org/events/95510
yoppe
April 12, 2018
More Decks by yoppe
See All by yoppe
20211023_recsys2021_paper_reading_YoheiKikuta
diracdiego
1
510
20201121_oldpaperreading_computing_machinery_and_intelligence
diracdiego
0
190
20200906_ACL2020_metric_for_ordinal_classification_YoheiKikuta
diracdiego
1
1.3k
20191102_ACL2019_adversarial_examples_in_NLP_YoheiKIKUTA
diracdiego
2
1.5k
20190223_nlpaperchallenge_CV_4.3to5.5
diracdiego
2
860
20180701_CVPR2018_reading_YoheiKIKUTA
diracdiego
3
1.3k
20180306_NIPS2017_DeepLearning
diracdiego
4
6k
20180215_MLKitchen7_YoheiKIKUTA
diracdiego
0
480
20180210_Cookpad_TechConf2018_YoheiKIKUTA
diracdiego
5
1.3k
Other Decks in Science
See All in Science
なぜエネルギーは保存する? 〜自由落下でわかる“対称性”とネーターの定理〜
syotasasaki593876
0
180
【RSJ2025】PAMIQ Core: リアルタイム継続学習のための⾮同期推論・学習フレームワーク
gesonanko
0
880
20251212_LT忘年会_データサイエンス枠_新川.pdf
shinpsan
0
290
Kaggle: NeurIPS - Open Polymer Prediction 2025 コンペ 反省会
calpis10000
0
580
Inside the Mind of an LLM
baggiponte
0
170
Understanding CVP Waveforms: Interpretation and Clinical Implications in Anesthesiology
taka88
0
560
先端因果推論特別研究チームの研究構想と 人間とAIが協働する自律因果探索の展望
sshimizu2006
3
920
Accelerating operator Sinkhorn iteration with overrelaxation
tasusu
0
340
大黒市で発生した大規模インシデント の ポストモーテムから読み解く、 記憶媒体消去の大切さ
shucho0103
0
180
中央大学AI・データサイエンスセンター 2025年第6回イブニングセミナー 『知能とはなにか ヒトとAIのあいだ』
tagtag
PRO
0
160
(2025) Balade en cyclotomie
mansuy
0
620
HajimetenoLT vol.17
hashimoto_kei
1
240
Featured
See All Featured
Automating Front-end Workflow
addyosmani
1370
210k
For a Future-Friendly Web
brad_frost
183
10k
Public Speaking Without Barfing On Your Shoes - THAT 2023
reverentgeek
1
410
Principles of Awesome APIs and How to Build Them.
keavy
128
17k
Noah Learner - AI + Me: how we built a GSC Bulk Export data pipeline
techseoconnect
PRO
0
190
Jess Joyce - The Pitfalls of Following Frameworks
techseoconnect
PRO
1
160
[RailsConf 2023] Rails as a piece of cake
palkan
59
6.6k
Performance Is Good for Brains [We Love Speed 2024]
tammyeverts
12
1.7k
ラッコキーワード サービス紹介資料
rakko
1
3.5M
Marketing to machines
jonoalderson
1
5.3k
A better future with KSS
kneath
240
18k
How STYLIGHT went responsive
nonsquared
100
6.2k
Transcript
Why People Search for Images using Web Search Engines WSDM
2018 จಡΈձ 20180414 ٠ా ངฏ (@yohei_kikuta) Event URL: https://atnd.org/events/95510, paper: https://arxiv.org/abs/1711.09559
·ͱΊ 1. text base ͷΣϒը૾ݕࡧͷҙਤྨՄೳ͔ʁ → YES. 3ͭʹྨ: Entertain, Explore/Learn,
Locate/Acquire 2. औಘՄೳͳಛྔ͔ΒҙਤΛผͰ͖Δ͔ʁ → YES. ཹ࣌ؒϚεδΣενϟ 3. ηογϣϯॳظͰݕࡧҙਤΛ༧ଌͰ͖Δ͔ → MAYBE. ಛྔΛͬͯϞσϧΛ࡞ͯ͠Ұఆͷੑೳ 2
എܠ 3
ݕࡧͷҙਤΛΓ͍ͨ Ϣʔβͷݕࡧߦಈͷཪʹ͋ΔҙਤΛΔ͜ͱॏཁ → Ϣʔβͷຬ্ʢsuggestion, recommendation, ...ʣ Σϒݕࡧͷݚڀͳ͞Ε͖͕ͯͨɺը૾ݕࡧʹؔͯ͠ݶఆత → ΫΤϦϕʔε →
͔͠͠ը૾ݕࡧͷΫΤϦ͘ͳΓ͕ͪͰෆ࣮֬ੑ͕େ͖͍ ຊจͰηογϣϯใΛѻͬͯը૾ݕࡧͷҙਤΛݚڀ 4
ຊจʹ͓͚ΔϦαʔνΫΤενϣϯ 1. text base ͷΣϒը૾ݕࡧͷҙਤྨՄೳ͔ʁ 2. औಘՄೳͳಛྔ͔ΒҙਤΛผͰ͖Δ͔ʁ 3. ηογϣϯॳظͰݕࡧҙਤΛ༧ଌͰ͖Δ͔ 5
ઌߦݚڀ 6
Σϒݕࡧʹ͓͚Δҙਤͷ taxonomy A taxonomy of web search (2002) ͰҙਤΛ3ͭʹྨ 1.
Navigational: ಛఆͷαΠτ౸ୡ 2. Informational: ใͷऔಘ 3. Transactional: ΣϒΛഔհͱͨ͠׆ಈ Ref: https://dl.acm.org/citation.cfm?id=792552 7
Σϒݕࡧʹ͓͚Δҙਤͷ taxonomy Task Behaviors During Web Search: The Difficulty of
Assigning Labels (2009) ͰݕࡧλεΫΛ7ͭʹྨ » Navigate, Find-Simple, Find-Complex, Locate/Acquire, Explore/Learn, Play, Meta ຊจ͜ͷઌߦݚڀΛ౿ऻͭͭ͠ը૾ݕࡧʹൃలͤͨ͞ͷɺͱ͍͏ ৭߹͍͕ڧ͍ Ref: http://ieeexplore.ieee.org/document/4755491/ 8
ը૾ݕࡧͷҙਤΛྨ 9
Ξϓϩʔν σʔλΛूΊͯͦΕΛجʹ3ਓͷΣϒݚڀऀ͕ྨ » ϢʔβͷΞϯέʔτσʔλ » ੑผใͳͲΛऔಘ » ࠷ۙͷݕࡧʹؔ͢ΔৄࡉʢಈػͳͲʣɺ༻ͨ͠ΫΤϦ » దͳճΛͨ͠211ਓ͕ର
10
Ξϓϩʔν σʔλΛूΊͯͦΕΛجʹ3ਓͷΣϒݚڀऀ͕ྨ » ϩάσʔλ » https://www.sogou.com/ ͷϩάσʔλ » 30Ҏʹ࿈ଓతͳΫΤϦΛ༩͍͑ͯΔ475ηογϣϯʢআ͘Ξμ ϧτʣ
11
Ξϓϩʔν σʔλΛूΊͯͦΕΛجʹ3ਓͷΣϒݚڀऀ͕ྨ » ϩάσʔλʢlength ΫΤϦʣ Ref: https://arxiv.org/abs/1711.09559 12
࡞ͨ͠அج४ 1. Ϣʔβͷݕࡧߦಈ໌֬ͳతʹґΔͷ͔ʁ 2. ޙͷར༻ͷͨΊʹը૾Λμϯϩʔυ͢Δඞཁ͕͋Δ͔ʁ 13
3ͭͷݕࡧҙਤ 1. Explore/Learn (1-yes, 2-no) ྫʣΰϦϥͱϘϊϘͷݟͨͷҧ͍ΛνΣοΫ 2. Locate/Acquire (1-yes, 2-yes)
ྫʣϨϙʔτ࡞Ͱ͏ΰϦϥͷը૾Λ୳ͯ͠μϯϩʔυ 3. Entertain (1-no, 2-yes or no) ྫʣΰϦϥͷ໘നը૾ΛோΊΔ 14
3ͭͷݕࡧҙਤʢྫʣ Ref: https://arxiv.org/abs/1711.09559 15
ଥੑͷݕূʢ3ਓͷେֶӃੜʹΑΔҙਤྨʣ » ϢʔβͷΞϯέʔτσʔλ Fleiss' kappa: 0.673 Explore/Learn: 27%, Locate/Acquire: 66%,
Entertain: 7% » ϩάσʔλʢΫΤϦͷΈΛ༻ʣ Fleiss' kappa: 0.375 Explore/Learn: 56%, Locate/Acquire: 39%, Entertain: 5% ͏·͚͘Εͦ͏͕ͩΫΤϦͷΈͰҙਤΛΉͷ͍͠ 16
औಘՄೳͳಛྔͰҙਤΛผ 17
35ਓͷֶ෦ੜʹΑΔ12ݸͷը૾ݕࡧλεΫ ྫʣPCͷഎܠΛ੨ۭͱͷը૾ʹมߋʢLocate/Acquireʣ ͦͷࡍʹҎԼͷಛྔΛऔಘ Ref: https://arxiv.org/abs/1711.09559 18
ҙਤʹΑͬͯ༗ҙͳ͕ࠩग़ΔͷͰผՄೳ ఀཹ࣌ؒ E/L ͕ଟ͍ɺϚεΫϦοΫ E/L < L/A < EɺͳͲ ʢৄࡉจΛࢀরʣ
Ref: https://arxiv.org/abs/1711.09559 19
ηογϣϯॳظͰͷҙਤͷ༧ଌ 20
ઃఆ ηογϣϯॳظͱʮ࠷ॳͷϚεεΫϩʔϧ͕͋Δ·Ͱʯ ༧ଌͰ͏ feature ͱͯ͠ҎԼͷҙ - ΫϦοΫͱ࠷ॳͷϚεΦʔόʔ࣌ؒΘͳ͍ - ΫΤϦϕʔεͰΓ͍ͨͷͰ query
reformulation Θͳ͍ ֶ෦ੜʹղ͔ͤͨը૾ݕࡧλεΫʹରͯ͠ GBDT Ͱ 10-fold CV 21
༧ଌੑೳߴ͘ͳ͍͕ෆՄೳͰͳͦ͞͏ Baseline majority ʹશ෦دͤΔͱ͍͏ͷ Ref: https://arxiv.org/abs/1711.09559 22
·ͱΊͱॴײ 23
·ͱΊʢ࠶ܝʣ 1. text base ͷΣϒը૾ݕࡧͷҙਤྨՄೳ͔ʁ → YES. 3ͭʹྨ: Entertain, Explore/Learn,
Locate/Acquire 2. औಘՄೳͳಛྔ͔ΒҙਤΛผͰ͖Δ͔ʁ → YES. ཹ࣌ؒϚεδΣενϟ 3. ηογϣϯॳظͰݕࡧҙਤΛ༧ଌͰ͖Δ͔ → MAYBE. ಛྔΛͬͯϞσϧΛ࡞ͯ͠Ұఆͷੑೳ 24
ॴײ » γϯϓϧͳج४ͰྨΛ͍ͯ͠Δͱ͍͏ͷྑ͍ » ৽ͱ͍͏Θ͚Ͱͳ͍͕ҰͭҰ͔ͭͬ͠Γௐ͍ͯΔ » ࣮αʔϏεͷԠ༻ʹҰาඈ༂͕ඞཁͦ͏ʢ༧ଌੑೳͳͲʣ » ٱʑʹࣜΛશવΘͳ͍จΛಡΜͰ৽ͩͬͨ 25