Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
20180414_WSDM2018_reading_YoheiKIKUTA
Search
yoppe
April 12, 2018
Science
0
700
20180414_WSDM2018_reading_YoheiKIKUTA
HP:
https://atnd.org/events/95510
yoppe
April 12, 2018
Tweet
Share
More Decks by yoppe
See All by yoppe
20211023_recsys2021_paper_reading_YoheiKikuta
diracdiego
2
470
20201121_oldpaperreading_computing_machinery_and_intelligence
diracdiego
0
150
20200906_ACL2020_metric_for_ordinal_classification_YoheiKikuta
diracdiego
1
1.3k
20191102_ACL2019_adversarial_examples_in_NLP_YoheiKIKUTA
diracdiego
2
1.4k
20190223_nlpaperchallenge_CV_4.3to5.5
diracdiego
2
790
20180701_CVPR2018_reading_YoheiKIKUTA
diracdiego
3
1.2k
20180306_NIPS2017_DeepLearning
diracdiego
4
5.9k
20180215_MLKitchen7_YoheiKIKUTA
diracdiego
0
410
20180210_Cookpad_TechConf2018_YoheiKIKUTA
diracdiego
5
1.2k
Other Decks in Science
See All in Science
WCS-LA-2024
lcolladotor
0
190
科学で迫る勝敗の法則(名城大学公開講座.2024年10月) / The principle of victory discovered by science (Open lecture in Meijo Univ. 2024)
konakalab
0
280
02_西村訓弘_プログラムディレクター_人口減少を機にひらく未来社会.pdf
sip3ristex
0
190
01_篠原弘道_SIPガバニングボード座長_ポスコロSIPへの期待.pdf
sip3ristex
0
200
Online Feedback Optimization
floriandoerfler
0
940
はじめてのバックドア基準:あるいは、重回帰分析の偏回帰係数を因果効果の推定値として解釈してよいのか問題
takehikoihayashi
2
1.4k
Coqで選択公理を形式化してみた
soukouki
0
300
Cross-Media Information Spaces and Architectures (CISA)
signer
PRO
3
31k
大規模言語モデルの論理構造の把握能力と予測モデルの生成
fuyu_quant0
0
110
Planted Clique Conjectures are Equivalent
nobushimi
0
120
局所保存性・相似変換対称性を満たす機械学習モデルによる数値流体力学
yellowshippo
1
180
テンソル分解を用いた教師なし学習による変数選択法のシングルセルマルチオミックスデータ解析への応用
tagtag
1
120
Featured
See All Featured
Bash Introduction
62gerente
611
210k
YesSQL, Process and Tooling at Scale
rocio
172
14k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
40
2k
Distributed Sagas: A Protocol for Coordinating Microservices
caitiem20
330
21k
What's in a price? How to price your products and services
michaelherold
244
12k
A designer walks into a library…
pauljervisheath
205
24k
I Don’t Have Time: Getting Over the Fear to Launch Your Podcast
jcasabona
32
2.2k
Producing Creativity
orderedlist
PRO
344
40k
Performance Is Good for Brains [We Love Speed 2024]
tammyeverts
7
660
Statistics for Hackers
jakevdp
797
220k
The MySQL Ecosystem @ GitHub 2015
samlambert
251
12k
Speed Design
sergeychernyshev
28
820
Transcript
Why People Search for Images using Web Search Engines WSDM
2018 จಡΈձ 20180414 ٠ా ངฏ (@yohei_kikuta) Event URL: https://atnd.org/events/95510, paper: https://arxiv.org/abs/1711.09559
·ͱΊ 1. text base ͷΣϒը૾ݕࡧͷҙਤྨՄೳ͔ʁ → YES. 3ͭʹྨ: Entertain, Explore/Learn,
Locate/Acquire 2. औಘՄೳͳಛྔ͔ΒҙਤΛผͰ͖Δ͔ʁ → YES. ཹ࣌ؒϚεδΣενϟ 3. ηογϣϯॳظͰݕࡧҙਤΛ༧ଌͰ͖Δ͔ → MAYBE. ಛྔΛͬͯϞσϧΛ࡞ͯ͠Ұఆͷੑೳ 2
എܠ 3
ݕࡧͷҙਤΛΓ͍ͨ Ϣʔβͷݕࡧߦಈͷཪʹ͋ΔҙਤΛΔ͜ͱॏཁ → Ϣʔβͷຬ্ʢsuggestion, recommendation, ...ʣ Σϒݕࡧͷݚڀͳ͞Ε͖͕ͯͨɺը૾ݕࡧʹؔͯ͠ݶఆత → ΫΤϦϕʔε →
͔͠͠ը૾ݕࡧͷΫΤϦ͘ͳΓ͕ͪͰෆ࣮֬ੑ͕େ͖͍ ຊจͰηογϣϯใΛѻͬͯը૾ݕࡧͷҙਤΛݚڀ 4
ຊจʹ͓͚ΔϦαʔνΫΤενϣϯ 1. text base ͷΣϒը૾ݕࡧͷҙਤྨՄೳ͔ʁ 2. औಘՄೳͳಛྔ͔ΒҙਤΛผͰ͖Δ͔ʁ 3. ηογϣϯॳظͰݕࡧҙਤΛ༧ଌͰ͖Δ͔ 5
ઌߦݚڀ 6
Σϒݕࡧʹ͓͚Δҙਤͷ taxonomy A taxonomy of web search (2002) ͰҙਤΛ3ͭʹྨ 1.
Navigational: ಛఆͷαΠτ౸ୡ 2. Informational: ใͷऔಘ 3. Transactional: ΣϒΛഔհͱͨ͠׆ಈ Ref: https://dl.acm.org/citation.cfm?id=792552 7
Σϒݕࡧʹ͓͚Δҙਤͷ taxonomy Task Behaviors During Web Search: The Difficulty of
Assigning Labels (2009) ͰݕࡧλεΫΛ7ͭʹྨ » Navigate, Find-Simple, Find-Complex, Locate/Acquire, Explore/Learn, Play, Meta ຊจ͜ͷઌߦݚڀΛ౿ऻͭͭ͠ը૾ݕࡧʹൃలͤͨ͞ͷɺͱ͍͏ ৭߹͍͕ڧ͍ Ref: http://ieeexplore.ieee.org/document/4755491/ 8
ը૾ݕࡧͷҙਤΛྨ 9
Ξϓϩʔν σʔλΛूΊͯͦΕΛجʹ3ਓͷΣϒݚڀऀ͕ྨ » ϢʔβͷΞϯέʔτσʔλ » ੑผใͳͲΛऔಘ » ࠷ۙͷݕࡧʹؔ͢ΔৄࡉʢಈػͳͲʣɺ༻ͨ͠ΫΤϦ » దͳճΛͨ͠211ਓ͕ର
10
Ξϓϩʔν σʔλΛूΊͯͦΕΛجʹ3ਓͷΣϒݚڀऀ͕ྨ » ϩάσʔλ » https://www.sogou.com/ ͷϩάσʔλ » 30Ҏʹ࿈ଓతͳΫΤϦΛ༩͍͑ͯΔ475ηογϣϯʢআ͘Ξμ ϧτʣ
11
Ξϓϩʔν σʔλΛूΊͯͦΕΛجʹ3ਓͷΣϒݚڀऀ͕ྨ » ϩάσʔλʢlength ΫΤϦʣ Ref: https://arxiv.org/abs/1711.09559 12
࡞ͨ͠அج४ 1. Ϣʔβͷݕࡧߦಈ໌֬ͳతʹґΔͷ͔ʁ 2. ޙͷར༻ͷͨΊʹը૾Λμϯϩʔυ͢Δඞཁ͕͋Δ͔ʁ 13
3ͭͷݕࡧҙਤ 1. Explore/Learn (1-yes, 2-no) ྫʣΰϦϥͱϘϊϘͷݟͨͷҧ͍ΛνΣοΫ 2. Locate/Acquire (1-yes, 2-yes)
ྫʣϨϙʔτ࡞Ͱ͏ΰϦϥͷը૾Λ୳ͯ͠μϯϩʔυ 3. Entertain (1-no, 2-yes or no) ྫʣΰϦϥͷ໘നը૾ΛோΊΔ 14
3ͭͷݕࡧҙਤʢྫʣ Ref: https://arxiv.org/abs/1711.09559 15
ଥੑͷݕূʢ3ਓͷେֶӃੜʹΑΔҙਤྨʣ » ϢʔβͷΞϯέʔτσʔλ Fleiss' kappa: 0.673 Explore/Learn: 27%, Locate/Acquire: 66%,
Entertain: 7% » ϩάσʔλʢΫΤϦͷΈΛ༻ʣ Fleiss' kappa: 0.375 Explore/Learn: 56%, Locate/Acquire: 39%, Entertain: 5% ͏·͚͘Εͦ͏͕ͩΫΤϦͷΈͰҙਤΛΉͷ͍͠ 16
औಘՄೳͳಛྔͰҙਤΛผ 17
35ਓͷֶ෦ੜʹΑΔ12ݸͷը૾ݕࡧλεΫ ྫʣPCͷഎܠΛ੨ۭͱͷը૾ʹมߋʢLocate/Acquireʣ ͦͷࡍʹҎԼͷಛྔΛऔಘ Ref: https://arxiv.org/abs/1711.09559 18
ҙਤʹΑͬͯ༗ҙͳ͕ࠩग़ΔͷͰผՄೳ ఀཹ࣌ؒ E/L ͕ଟ͍ɺϚεΫϦοΫ E/L < L/A < EɺͳͲ ʢৄࡉจΛࢀরʣ
Ref: https://arxiv.org/abs/1711.09559 19
ηογϣϯॳظͰͷҙਤͷ༧ଌ 20
ઃఆ ηογϣϯॳظͱʮ࠷ॳͷϚεεΫϩʔϧ͕͋Δ·Ͱʯ ༧ଌͰ͏ feature ͱͯ͠ҎԼͷҙ - ΫϦοΫͱ࠷ॳͷϚεΦʔόʔ࣌ؒΘͳ͍ - ΫΤϦϕʔεͰΓ͍ͨͷͰ query
reformulation Θͳ͍ ֶ෦ੜʹղ͔ͤͨը૾ݕࡧλεΫʹରͯ͠ GBDT Ͱ 10-fold CV 21
༧ଌੑೳߴ͘ͳ͍͕ෆՄೳͰͳͦ͞͏ Baseline majority ʹશ෦دͤΔͱ͍͏ͷ Ref: https://arxiv.org/abs/1711.09559 22
·ͱΊͱॴײ 23
·ͱΊʢ࠶ܝʣ 1. text base ͷΣϒը૾ݕࡧͷҙਤྨՄೳ͔ʁ → YES. 3ͭʹྨ: Entertain, Explore/Learn,
Locate/Acquire 2. औಘՄೳͳಛྔ͔ΒҙਤΛผͰ͖Δ͔ʁ → YES. ཹ࣌ؒϚεδΣενϟ 3. ηογϣϯॳظͰݕࡧҙਤΛ༧ଌͰ͖Δ͔ → MAYBE. ಛྔΛͬͯϞσϧΛ࡞ͯ͠Ұఆͷੑೳ 24
ॴײ » γϯϓϧͳج४ͰྨΛ͍ͯ͠Δͱ͍͏ͷྑ͍ » ৽ͱ͍͏Θ͚Ͱͳ͍͕ҰͭҰ͔ͭͬ͠Γௐ͍ͯΔ » ࣮αʔϏεͷԠ༻ʹҰาඈ༂͕ඞཁͦ͏ʢ༧ଌੑೳͳͲʣ » ٱʑʹࣜΛશવΘͳ͍จΛಡΜͰ৽ͩͬͨ 25