Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
20180414_WSDM2018_reading_YoheiKIKUTA
Search
yoppe
April 12, 2018
Science
0
720
20180414_WSDM2018_reading_YoheiKIKUTA
HP:
https://atnd.org/events/95510
yoppe
April 12, 2018
Tweet
Share
More Decks by yoppe
See All by yoppe
20211023_recsys2021_paper_reading_YoheiKikuta
diracdiego
2
490
20201121_oldpaperreading_computing_machinery_and_intelligence
diracdiego
0
170
20200906_ACL2020_metric_for_ordinal_classification_YoheiKikuta
diracdiego
1
1.3k
20191102_ACL2019_adversarial_examples_in_NLP_YoheiKIKUTA
diracdiego
2
1.4k
20190223_nlpaperchallenge_CV_4.3to5.5
diracdiego
2
830
20180701_CVPR2018_reading_YoheiKIKUTA
diracdiego
3
1.2k
20180306_NIPS2017_DeepLearning
diracdiego
4
5.9k
20180215_MLKitchen7_YoheiKIKUTA
diracdiego
0
440
20180210_Cookpad_TechConf2018_YoheiKIKUTA
diracdiego
5
1.2k
Other Decks in Science
See All in Science
白金鉱業Meetup_Vol.20 効果検証ことはじめ / Introduction to Impact Evaluation
brainpadpr
1
840
LayerXにおける業務の完全自動運転化に向けたAI技術活用事例 / layerx-ai-jsai2025
shimacos
2
1.6k
システム数理と応用分野の未来を切り拓くロードマップ・エンターテインメント(スポーツ)への応用 / Applied mathematics for sports entertainment
konakalab
1
400
テンソル分解による糖尿病の組織特異的遺伝子発現の統合解析を用いた関連疾患の予測
tagtag
2
250
🌏地球から🌌宇宙まで! 〜ケプラーの法則で繋がる天体の運動〜
syotasasaki593876
1
110
Quelles valorisations des logiciels vers le monde socio-économique dans un contexte de Science Ouverte ?
bluehats
1
530
データベース15: ビッグデータ時代のデータベース
trycycle
PRO
0
360
データベース03: 関係データモデル
trycycle
PRO
1
270
データマイニング - ノードの中心性
trycycle
PRO
0
270
深層学習を用いた根菜類の個数カウントによる収量推定法の開発
kentaitakura
0
180
ランサムウェア対策にも考慮したVMware、Hyper-V、Azure、AWS間のリアルタイムレプリケーション「Zerto」を徹底解説
climbteam
0
120
研究って何だっけ / What is Research?
ks91
PRO
1
130
Featured
See All Featured
RailsConf 2023
tenderlove
30
1.2k
RailsConf & Balkan Ruby 2019: The Past, Present, and Future of Rails at GitHub
eileencodes
140
34k
Automating Front-end Workflow
addyosmani
1371
200k
Speed Design
sergeychernyshev
32
1.1k
Reflections from 52 weeks, 52 projects
jeffersonlam
352
21k
Why You Should Never Use an ORM
jnunemaker
PRO
59
9.5k
Become a Pro
speakerdeck
PRO
29
5.5k
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
252
21k
Rebuilding a faster, lazier Slack
samanthasiow
84
9.2k
YesSQL, Process and Tooling at Scale
rocio
173
14k
Evolution of real-time – Irina Nazarova, EuRuKo, 2024
irinanazarova
9
960
How GitHub (no longer) Works
holman
315
140k
Transcript
Why People Search for Images using Web Search Engines WSDM
2018 จಡΈձ 20180414 ٠ా ངฏ (@yohei_kikuta) Event URL: https://atnd.org/events/95510, paper: https://arxiv.org/abs/1711.09559
·ͱΊ 1. text base ͷΣϒը૾ݕࡧͷҙਤྨՄೳ͔ʁ → YES. 3ͭʹྨ: Entertain, Explore/Learn,
Locate/Acquire 2. औಘՄೳͳಛྔ͔ΒҙਤΛผͰ͖Δ͔ʁ → YES. ཹ࣌ؒϚεδΣενϟ 3. ηογϣϯॳظͰݕࡧҙਤΛ༧ଌͰ͖Δ͔ → MAYBE. ಛྔΛͬͯϞσϧΛ࡞ͯ͠Ұఆͷੑೳ 2
എܠ 3
ݕࡧͷҙਤΛΓ͍ͨ Ϣʔβͷݕࡧߦಈͷཪʹ͋ΔҙਤΛΔ͜ͱॏཁ → Ϣʔβͷຬ্ʢsuggestion, recommendation, ...ʣ Σϒݕࡧͷݚڀͳ͞Ε͖͕ͯͨɺը૾ݕࡧʹؔͯ͠ݶఆత → ΫΤϦϕʔε →
͔͠͠ը૾ݕࡧͷΫΤϦ͘ͳΓ͕ͪͰෆ࣮֬ੑ͕େ͖͍ ຊจͰηογϣϯใΛѻͬͯը૾ݕࡧͷҙਤΛݚڀ 4
ຊจʹ͓͚ΔϦαʔνΫΤενϣϯ 1. text base ͷΣϒը૾ݕࡧͷҙਤྨՄೳ͔ʁ 2. औಘՄೳͳಛྔ͔ΒҙਤΛผͰ͖Δ͔ʁ 3. ηογϣϯॳظͰݕࡧҙਤΛ༧ଌͰ͖Δ͔ 5
ઌߦݚڀ 6
Σϒݕࡧʹ͓͚Δҙਤͷ taxonomy A taxonomy of web search (2002) ͰҙਤΛ3ͭʹྨ 1.
Navigational: ಛఆͷαΠτ౸ୡ 2. Informational: ใͷऔಘ 3. Transactional: ΣϒΛഔհͱͨ͠׆ಈ Ref: https://dl.acm.org/citation.cfm?id=792552 7
Σϒݕࡧʹ͓͚Δҙਤͷ taxonomy Task Behaviors During Web Search: The Difficulty of
Assigning Labels (2009) ͰݕࡧλεΫΛ7ͭʹྨ » Navigate, Find-Simple, Find-Complex, Locate/Acquire, Explore/Learn, Play, Meta ຊจ͜ͷઌߦݚڀΛ౿ऻͭͭ͠ը૾ݕࡧʹൃలͤͨ͞ͷɺͱ͍͏ ৭߹͍͕ڧ͍ Ref: http://ieeexplore.ieee.org/document/4755491/ 8
ը૾ݕࡧͷҙਤΛྨ 9
Ξϓϩʔν σʔλΛूΊͯͦΕΛجʹ3ਓͷΣϒݚڀऀ͕ྨ » ϢʔβͷΞϯέʔτσʔλ » ੑผใͳͲΛऔಘ » ࠷ۙͷݕࡧʹؔ͢ΔৄࡉʢಈػͳͲʣɺ༻ͨ͠ΫΤϦ » దͳճΛͨ͠211ਓ͕ର
10
Ξϓϩʔν σʔλΛूΊͯͦΕΛجʹ3ਓͷΣϒݚڀऀ͕ྨ » ϩάσʔλ » https://www.sogou.com/ ͷϩάσʔλ » 30Ҏʹ࿈ଓతͳΫΤϦΛ༩͍͑ͯΔ475ηογϣϯʢআ͘Ξμ ϧτʣ
11
Ξϓϩʔν σʔλΛूΊͯͦΕΛجʹ3ਓͷΣϒݚڀऀ͕ྨ » ϩάσʔλʢlength ΫΤϦʣ Ref: https://arxiv.org/abs/1711.09559 12
࡞ͨ͠அج४ 1. Ϣʔβͷݕࡧߦಈ໌֬ͳతʹґΔͷ͔ʁ 2. ޙͷར༻ͷͨΊʹը૾Λμϯϩʔυ͢Δඞཁ͕͋Δ͔ʁ 13
3ͭͷݕࡧҙਤ 1. Explore/Learn (1-yes, 2-no) ྫʣΰϦϥͱϘϊϘͷݟͨͷҧ͍ΛνΣοΫ 2. Locate/Acquire (1-yes, 2-yes)
ྫʣϨϙʔτ࡞Ͱ͏ΰϦϥͷը૾Λ୳ͯ͠μϯϩʔυ 3. Entertain (1-no, 2-yes or no) ྫʣΰϦϥͷ໘നը૾ΛோΊΔ 14
3ͭͷݕࡧҙਤʢྫʣ Ref: https://arxiv.org/abs/1711.09559 15
ଥੑͷݕূʢ3ਓͷେֶӃੜʹΑΔҙਤྨʣ » ϢʔβͷΞϯέʔτσʔλ Fleiss' kappa: 0.673 Explore/Learn: 27%, Locate/Acquire: 66%,
Entertain: 7% » ϩάσʔλʢΫΤϦͷΈΛ༻ʣ Fleiss' kappa: 0.375 Explore/Learn: 56%, Locate/Acquire: 39%, Entertain: 5% ͏·͚͘Εͦ͏͕ͩΫΤϦͷΈͰҙਤΛΉͷ͍͠ 16
औಘՄೳͳಛྔͰҙਤΛผ 17
35ਓͷֶ෦ੜʹΑΔ12ݸͷը૾ݕࡧλεΫ ྫʣPCͷഎܠΛ੨ۭͱͷը૾ʹมߋʢLocate/Acquireʣ ͦͷࡍʹҎԼͷಛྔΛऔಘ Ref: https://arxiv.org/abs/1711.09559 18
ҙਤʹΑͬͯ༗ҙͳ͕ࠩग़ΔͷͰผՄೳ ఀཹ࣌ؒ E/L ͕ଟ͍ɺϚεΫϦοΫ E/L < L/A < EɺͳͲ ʢৄࡉจΛࢀরʣ
Ref: https://arxiv.org/abs/1711.09559 19
ηογϣϯॳظͰͷҙਤͷ༧ଌ 20
ઃఆ ηογϣϯॳظͱʮ࠷ॳͷϚεεΫϩʔϧ͕͋Δ·Ͱʯ ༧ଌͰ͏ feature ͱͯ͠ҎԼͷҙ - ΫϦοΫͱ࠷ॳͷϚεΦʔόʔ࣌ؒΘͳ͍ - ΫΤϦϕʔεͰΓ͍ͨͷͰ query
reformulation Θͳ͍ ֶ෦ੜʹղ͔ͤͨը૾ݕࡧλεΫʹରͯ͠ GBDT Ͱ 10-fold CV 21
༧ଌੑೳߴ͘ͳ͍͕ෆՄೳͰͳͦ͞͏ Baseline majority ʹશ෦دͤΔͱ͍͏ͷ Ref: https://arxiv.org/abs/1711.09559 22
·ͱΊͱॴײ 23
·ͱΊʢ࠶ܝʣ 1. text base ͷΣϒը૾ݕࡧͷҙਤྨՄೳ͔ʁ → YES. 3ͭʹྨ: Entertain, Explore/Learn,
Locate/Acquire 2. औಘՄೳͳಛྔ͔ΒҙਤΛผͰ͖Δ͔ʁ → YES. ཹ࣌ؒϚεδΣενϟ 3. ηογϣϯॳظͰݕࡧҙਤΛ༧ଌͰ͖Δ͔ → MAYBE. ಛྔΛͬͯϞσϧΛ࡞ͯ͠Ұఆͷੑೳ 24
ॴײ » γϯϓϧͳج४ͰྨΛ͍ͯ͠Δͱ͍͏ͷྑ͍ » ৽ͱ͍͏Θ͚Ͱͳ͍͕ҰͭҰ͔ͭͬ͠Γௐ͍ͯΔ » ࣮αʔϏεͷԠ༻ʹҰาඈ༂͕ඞཁͦ͏ʢ༧ଌੑೳͳͲʣ » ٱʑʹࣜΛશવΘͳ͍จΛಡΜͰ৽ͩͬͨ 25