Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
20180414_WSDM2018_reading_YoheiKIKUTA
Search
yoppe
April 12, 2018
Science
0
740
20180414_WSDM2018_reading_YoheiKIKUTA
HP:
https://atnd.org/events/95510
yoppe
April 12, 2018
Tweet
Share
More Decks by yoppe
See All by yoppe
20211023_recsys2021_paper_reading_YoheiKikuta
diracdiego
2
500
20201121_oldpaperreading_computing_machinery_and_intelligence
diracdiego
0
180
20200906_ACL2020_metric_for_ordinal_classification_YoheiKikuta
diracdiego
1
1.3k
20191102_ACL2019_adversarial_examples_in_NLP_YoheiKIKUTA
diracdiego
2
1.4k
20190223_nlpaperchallenge_CV_4.3to5.5
diracdiego
2
850
20180701_CVPR2018_reading_YoheiKIKUTA
diracdiego
3
1.3k
20180306_NIPS2017_DeepLearning
diracdiego
4
6k
20180215_MLKitchen7_YoheiKIKUTA
diracdiego
0
470
20180210_Cookpad_TechConf2018_YoheiKIKUTA
diracdiego
5
1.2k
Other Decks in Science
See All in Science
Kaggle: NeurIPS - Open Polymer Prediction 2025 コンペ 反省会
calpis10000
0
410
【論文紹介】Is CLIP ideal? No. Can we fix it?Yes! 第65回 コンピュータビジョン勉強会@関東
shun6211
5
2.3k
AI(人工知能)の過去・現在・未来 —AIは人間を超えるのか—
tagtag
PRO
0
150
AIに仕事を奪われる 最初の医師たちへ
ikora128
0
1k
機械学習 - K-means & 階層的クラスタリング
trycycle
PRO
0
1.2k
データベース12: 正規化(2/2) - データ従属性に基づく正規化
trycycle
PRO
0
1.1k
データマイニング - グラフ構造の諸指標
trycycle
PRO
0
270
DMMにおけるABテスト検証設計の工夫
xc6da
1
1.6k
MCMCのR-hatは分散分析である
moricup
0
610
Navigating Weather and Climate Data
rabernat
0
130
Accelerated Computing for Climate forecast
inureyes
PRO
0
160
baseballrによるMLBデータの抽出と階層ベイズモデルによる打率の推定 / TokyoR118
dropout009
2
850
Featured
See All Featured
Learning to Love Humans: Emotional Interface Design
aarron
275
41k
What Being in a Rock Band Can Teach Us About Real World SEO
427marketing
0
180
Six Lessons from altMBA
skipperchong
29
4.2k
I Don’t Have Time: Getting Over the Fear to Launch Your Podcast
jcasabona
34
2.6k
Bioeconomy Workshop: Dr. Julius Ecuru, Opportunities for a Bioeconomy in West Africa
akademiya2063
PRO
1
67
Mind Mapping
helmedeiros
PRO
1
110
30 Presentation Tips
portentint
PRO
1
250
Why Your Marketing Sucks and What You Can Do About It - Sophie Logan
marketingsoph
0
90
Building an army of robots
kneath
306
46k
Color Theory Basics | Prateek | Gurzu
gurzu
0
220
How to optimise 3,500 product descriptions for ecommerce in one day using ChatGPT
katarinadahlin
PRO
1
3.5k
How to Talk to Developers About Accessibility
jct
2
140
Transcript
Why People Search for Images using Web Search Engines WSDM
2018 จಡΈձ 20180414 ٠ా ངฏ (@yohei_kikuta) Event URL: https://atnd.org/events/95510, paper: https://arxiv.org/abs/1711.09559
·ͱΊ 1. text base ͷΣϒը૾ݕࡧͷҙਤྨՄೳ͔ʁ → YES. 3ͭʹྨ: Entertain, Explore/Learn,
Locate/Acquire 2. औಘՄೳͳಛྔ͔ΒҙਤΛผͰ͖Δ͔ʁ → YES. ཹ࣌ؒϚεδΣενϟ 3. ηογϣϯॳظͰݕࡧҙਤΛ༧ଌͰ͖Δ͔ → MAYBE. ಛྔΛͬͯϞσϧΛ࡞ͯ͠Ұఆͷੑೳ 2
എܠ 3
ݕࡧͷҙਤΛΓ͍ͨ Ϣʔβͷݕࡧߦಈͷཪʹ͋ΔҙਤΛΔ͜ͱॏཁ → Ϣʔβͷຬ্ʢsuggestion, recommendation, ...ʣ Σϒݕࡧͷݚڀͳ͞Ε͖͕ͯͨɺը૾ݕࡧʹؔͯ͠ݶఆత → ΫΤϦϕʔε →
͔͠͠ը૾ݕࡧͷΫΤϦ͘ͳΓ͕ͪͰෆ࣮֬ੑ͕େ͖͍ ຊจͰηογϣϯใΛѻͬͯը૾ݕࡧͷҙਤΛݚڀ 4
ຊจʹ͓͚ΔϦαʔνΫΤενϣϯ 1. text base ͷΣϒը૾ݕࡧͷҙਤྨՄೳ͔ʁ 2. औಘՄೳͳಛྔ͔ΒҙਤΛผͰ͖Δ͔ʁ 3. ηογϣϯॳظͰݕࡧҙਤΛ༧ଌͰ͖Δ͔ 5
ઌߦݚڀ 6
Σϒݕࡧʹ͓͚Δҙਤͷ taxonomy A taxonomy of web search (2002) ͰҙਤΛ3ͭʹྨ 1.
Navigational: ಛఆͷαΠτ౸ୡ 2. Informational: ใͷऔಘ 3. Transactional: ΣϒΛഔհͱͨ͠׆ಈ Ref: https://dl.acm.org/citation.cfm?id=792552 7
Σϒݕࡧʹ͓͚Δҙਤͷ taxonomy Task Behaviors During Web Search: The Difficulty of
Assigning Labels (2009) ͰݕࡧλεΫΛ7ͭʹྨ » Navigate, Find-Simple, Find-Complex, Locate/Acquire, Explore/Learn, Play, Meta ຊจ͜ͷઌߦݚڀΛ౿ऻͭͭ͠ը૾ݕࡧʹൃలͤͨ͞ͷɺͱ͍͏ ৭߹͍͕ڧ͍ Ref: http://ieeexplore.ieee.org/document/4755491/ 8
ը૾ݕࡧͷҙਤΛྨ 9
Ξϓϩʔν σʔλΛूΊͯͦΕΛجʹ3ਓͷΣϒݚڀऀ͕ྨ » ϢʔβͷΞϯέʔτσʔλ » ੑผใͳͲΛऔಘ » ࠷ۙͷݕࡧʹؔ͢ΔৄࡉʢಈػͳͲʣɺ༻ͨ͠ΫΤϦ » దͳճΛͨ͠211ਓ͕ର
10
Ξϓϩʔν σʔλΛूΊͯͦΕΛجʹ3ਓͷΣϒݚڀऀ͕ྨ » ϩάσʔλ » https://www.sogou.com/ ͷϩάσʔλ » 30Ҏʹ࿈ଓతͳΫΤϦΛ༩͍͑ͯΔ475ηογϣϯʢআ͘Ξμ ϧτʣ
11
Ξϓϩʔν σʔλΛूΊͯͦΕΛجʹ3ਓͷΣϒݚڀऀ͕ྨ » ϩάσʔλʢlength ΫΤϦʣ Ref: https://arxiv.org/abs/1711.09559 12
࡞ͨ͠அج४ 1. Ϣʔβͷݕࡧߦಈ໌֬ͳతʹґΔͷ͔ʁ 2. ޙͷར༻ͷͨΊʹը૾Λμϯϩʔυ͢Δඞཁ͕͋Δ͔ʁ 13
3ͭͷݕࡧҙਤ 1. Explore/Learn (1-yes, 2-no) ྫʣΰϦϥͱϘϊϘͷݟͨͷҧ͍ΛνΣοΫ 2. Locate/Acquire (1-yes, 2-yes)
ྫʣϨϙʔτ࡞Ͱ͏ΰϦϥͷը૾Λ୳ͯ͠μϯϩʔυ 3. Entertain (1-no, 2-yes or no) ྫʣΰϦϥͷ໘നը૾ΛோΊΔ 14
3ͭͷݕࡧҙਤʢྫʣ Ref: https://arxiv.org/abs/1711.09559 15
ଥੑͷݕূʢ3ਓͷେֶӃੜʹΑΔҙਤྨʣ » ϢʔβͷΞϯέʔτσʔλ Fleiss' kappa: 0.673 Explore/Learn: 27%, Locate/Acquire: 66%,
Entertain: 7% » ϩάσʔλʢΫΤϦͷΈΛ༻ʣ Fleiss' kappa: 0.375 Explore/Learn: 56%, Locate/Acquire: 39%, Entertain: 5% ͏·͚͘Εͦ͏͕ͩΫΤϦͷΈͰҙਤΛΉͷ͍͠ 16
औಘՄೳͳಛྔͰҙਤΛผ 17
35ਓͷֶ෦ੜʹΑΔ12ݸͷը૾ݕࡧλεΫ ྫʣPCͷഎܠΛ੨ۭͱͷը૾ʹมߋʢLocate/Acquireʣ ͦͷࡍʹҎԼͷಛྔΛऔಘ Ref: https://arxiv.org/abs/1711.09559 18
ҙਤʹΑͬͯ༗ҙͳ͕ࠩग़ΔͷͰผՄೳ ఀཹ࣌ؒ E/L ͕ଟ͍ɺϚεΫϦοΫ E/L < L/A < EɺͳͲ ʢৄࡉจΛࢀরʣ
Ref: https://arxiv.org/abs/1711.09559 19
ηογϣϯॳظͰͷҙਤͷ༧ଌ 20
ઃఆ ηογϣϯॳظͱʮ࠷ॳͷϚεεΫϩʔϧ͕͋Δ·Ͱʯ ༧ଌͰ͏ feature ͱͯ͠ҎԼͷҙ - ΫϦοΫͱ࠷ॳͷϚεΦʔόʔ࣌ؒΘͳ͍ - ΫΤϦϕʔεͰΓ͍ͨͷͰ query
reformulation Θͳ͍ ֶ෦ੜʹղ͔ͤͨը૾ݕࡧλεΫʹରͯ͠ GBDT Ͱ 10-fold CV 21
༧ଌੑೳߴ͘ͳ͍͕ෆՄೳͰͳͦ͞͏ Baseline majority ʹશ෦دͤΔͱ͍͏ͷ Ref: https://arxiv.org/abs/1711.09559 22
·ͱΊͱॴײ 23
·ͱΊʢ࠶ܝʣ 1. text base ͷΣϒը૾ݕࡧͷҙਤྨՄೳ͔ʁ → YES. 3ͭʹྨ: Entertain, Explore/Learn,
Locate/Acquire 2. औಘՄೳͳಛྔ͔ΒҙਤΛผͰ͖Δ͔ʁ → YES. ཹ࣌ؒϚεδΣενϟ 3. ηογϣϯॳظͰݕࡧҙਤΛ༧ଌͰ͖Δ͔ → MAYBE. ಛྔΛͬͯϞσϧΛ࡞ͯ͠Ұఆͷੑೳ 24
ॴײ » γϯϓϧͳج४ͰྨΛ͍ͯ͠Δͱ͍͏ͷྑ͍ » ৽ͱ͍͏Θ͚Ͱͳ͍͕ҰͭҰ͔ͭͬ͠Γௐ͍ͯΔ » ࣮αʔϏεͷԠ༻ʹҰาඈ༂͕ඞཁͦ͏ʢ༧ଌੑೳͳͲʣ » ٱʑʹࣜΛશવΘͳ͍จΛಡΜͰ৽ͩͬͨ 25