Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Review: "Recommending Investors for Crowdfundin...
Search
yag_ays
July 09, 2014
Research
1.2k
1
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Review: "Recommending Investors for Crowdfunding Projects"
http://yagays.github.io/blog/2014/07/09/www2014review-kickstarter/
yag_ays
July 09, 2014
More Decks by yag_ays
See All by yag_ays
対話型AIの構築における工夫とデータセットの重要性 - 素早くデータを構築し検証するためには
yag_ays
3
7.1k
目と耳を持った自然言語処理 - スタートアップにおける価値創出のために
yag_ays
1
3.9k
時間情報表現抽出とルールベース解析器のこれから / Temporal Expression Analysis in Japanese and Future of Rule-based Approach
yag_ays
1
2.3k
Pythonで始める ドキュメント・インテリジェンス入門 / Introduction to Document Intelligence with Python
yag_ays
9
9.2k
"医者の言葉、患者の言葉、エンジニアの言葉" / MNTSQ Ubie Vertical ai
yag_ays
3
14k
LT at nlp_career
yag_ays
0
350
Other Decks in Research
See All in Research
2026年度 生成AI を活用した論文執筆ガイド/ワークショップ / 2026 Academic Year Guide to Writing Papers Using Generative AI - Workshop
ks91
PRO
0
170
Apache Gravitinoで実現する Icebergカタログ統合とアクセスの一元化
matsumooon
0
290
非試合日の野球場を楽しむためのARホームランボールキャッチ体験システムの開発 / EC79-miyazaki
yumulab
0
230
LiDAR点群の地表面分類手法の比較・検証
vegapunkhiroshi79
0
120
AGI4OPT:自然言語から数理最適化を導くエ ージェントスキル Translating Human Intent into Mathematical Optimization
mickey_kubo
0
140
「車1割削減、渋滞半減、公共交通2倍」を 熊本から岡山へ@RACDA設立30周年記念都市交通フォーラム2026
trafficbrain
1
1.2k
データセンター事業者を取り巻く近年の状況とその中での研究開発動向、テストベッドへの貢献の可能性
kikuzo
1
190
重要だけど測れていないもの:高齢者ケアの見えない課題
theoriatec2024
0
350
量子コンピュータの紹介
oqtopus
0
330
COFFEE-Japan PROJECT Impact Report(Uminomukou Coffee)
ontheslope
0
190
敵対生成プロンプト同時探索による内省型プロンプト最適化
kinoue_smarthr
0
210
進学校の生徒にはア行の苗字が多いのか
ozekinote
0
450
Featured
See All Featured
The Straight Up "How To Draw Better" Workshop
denniskardys
239
140k
The Language of Interfaces
destraynor
162
27k
Google's AI Overviews - The New Search
badams
0
1k
Stewardship and Sustainability of Urban and Community Forests
pwiseman
0
230
The Illustrated Guide to Node.js - THAT Conference 2024
reverentgeek
1
390
Principles of Awesome APIs and How to Build Them.
keavy
128
18k
A brief & incomplete history of UX Design for the World Wide Web: 1989–2019
jct
2
400
The Director’s Chair: Orchestrating AI for Truly Effective Learning
tmiket
1
190
Fireside Chat
paigeccino
42
4k
A Modern Web Designer's Workflow
chriscoyier
698
190k
What Being in a Rock Band Can Teach Us About Real World SEO
427marketing
0
250
Lessons Learnt from Crawling 1000+ Websites
charlesmeaden
PRO
1
1.3k
Transcript
Recommending Investors for Crowdfunding Projects WWW 2014 Jisun An, Daniele
Quercia, Jon Crowcroft จհ @yag_ays 1
ࠓճհ͢Δจͷ֓ཁ • “Recommending Investors for Crowdfunding Project” [Jisun+ 2014] •
WWW 2014 (Seoul, KOREA) • Jisun AnͷYahoo Labs in Barcelona Πϯλʔϯγοϓͷࣄ ! • ࠷ऴతͳඪɿKickstarterͷϑΝϯμʔͱग़ࢿऀͷϚονϯά • KickstarterͷϓϩδΣΫτग़ࢿऀͷੳϝΠϯͳͱ͜Ζ͕͋Δ 2
• ΫϥυϑΝϯσΟϯά • 2012ʹूΊͨࢿ૯ֹ$320 million • c.f. ࢿՈ/ϕϯνϟʔΩϟϐλϧʹΑΔग़ࢿ ! •
ϑΝϯμʔඪֹۚΛઃఆͯ͠ࢿΛืΔ • ࢿՈࢿֹۚʹԠͯ͡ใु͕Β͑Δ • e.g. $100ࢿͰ1ݸϓϨθϯτɼ$300ࢿͰ5ݸϓϨθϯτ Kickstarterͱ https://www.kickstarter.com/help/style_guide 3
Kickstarterޭࣄྫ • Oculus Rift • 9,522 / $ 2,437,429 •
Memoto (Narrative Clip) • 2,871 / $ 550,189 • Little Witch Academia 2 • 7,938 / $ 625,518 https://www.kickstarter.com/projects/1523379957/oculus-rift-step-into-the-game https://www.kickstarter.com/projects/martinkallstrom/memoto-lifelogging-camera https://www.kickstarter.com/projects/1311401276/little-witch-academia-2 4
Kickstarterͷಛघੑɿࢿʹࣦഊͨ͘͠ͳ͍͚Ͳ… • All or Nothing • ඪֹۚʹୡ͠ͳ͚ΕϓϩδΣΫτࣦഊɼࢿۚશֹฦ٫ • ϓϩδΣΫτͷޭ/ࣦഊʹؔΘΒͣɼࢿऀଛΛ͠ͳ͍ !
• ιʔγϟϧͳଆ໘ • ॳظࢿʹ༑ୡ͕ଟ͍ʢ20-40%ͱ͍͏ࢉग़ʣ • ेͳࢿՈΛूΊΒΕͳ͍ͱϓϩδΣΫτ͕ࣦഊ͍͢͠ 5
จͷྲྀΕ • Kickstarterʹ͓͚ΔࢿՈͷڍಈʹ͍ͭͯԾઆΛཱͯΔ • KickstarterTwitterͷใ͔ΒԾઆΛݕূ͢Δ • ࣗಈతʹϑΝϯμʔͱࢿՈͷϚονϯάΛߦ͏ϞσϧΛཱͯΔ 6
Kickstarterͷσʔλऩू/ղੳ Dataset and Pledging Behavior 7
σʔληοτ • Kickstarter͔ΒΫϩʔϧ • 20137݄͔Β10݄ʹొ͞Εͨͷ • USA෦ͷϑΝϯυͷΈ • ߹ܭ 1,149ϓϩδΣΫτ/
78,460ग़ࢿऀ • Twitter͔ΒΫϩʔϧ • ϓϩδΣΫτʹݴٴ͢ΔtweetͷΈ • ߹ܭ71,315 tweetΛऩू 8 (Average)
ࢿՈͷߏ • ߹ܭ78,460ਓͷग़ࢿऀ ! • 4ճະຬͷࢿΛͨ͠ਓ51% • ؾ·͙Εࢿऀ “Occasional Investors”
• 32ճҎ্ͷࢿΛͨ͠ਓ11% • ৗ࿈ࢿऀ “Frequent Investors” ؾ·͙ΕࢿՈ ৗ࿈ࢿՈ 9
ϓϩδΣΫτͷΧςΰϦʔ͝ͱͷࢿऀͷ༁ • Music, DanceͳͲ୯ൃͷग़ࢿ͕ଟ͍ • Gamesৗ࿈ࢿՈ͕ଟ͍ˠେنͳήʔϜ։ൃͷืूͳͲ 10
ࢿՈͷڍಈʹؔ͢ΔԾઆ • ৗ࿈ࢿՈҎԼͷΑ͏ͳੑ࣭ͷϓϩδΣΫτʹࢿ͍͢͠ • ใ͕සൟʹΞοϓσʔτ͞ΕΔ • ϑΝϯμʔ͕ࢿՈͷ࣭ʹ͑Δ • ࢿͷใु͕ྑ͍ •
ߴ͍ࢿֹۚͷϓϩδΣΫτৗ࿈ࢿՈʹࢿ͞Ε͍͢ • ϩʔΧϧͳϓϩδΣΫτؾ·͙ΕࢿՈʹࢿ͞Ε͍͢ • ૣ͘ࢿΛूΊΔϓϩδΣΫτৗ࿈ࢿՈʹࢿ͞Ε͍͢ • ৗ࿈ࢿՈࣗͷڵຯ͋ΔϓϩδΣΫτʹࢿ͍͢͠ 11
ϓϩδΣΫτͰ͢Δಛྔ • ϓϩδΣΫτͷߋ৽ • ϑΝϯμʔͷίϝϯτ • ใुͷϨϕϧ • ΣϒαΠτͷ༗ແ •
ඪֹۚ ($) • ཧతͳڑͷΒ͖ͭ • ϓϩδΣΫτͷ 12
ͦΕͧΕͷಛྔ͝ͱͷࢿऀͷ༁ ϓϩδΣΫτͷߋ৽ ϑΝϯμʔͷίϝϯτ ใुͷϨϕϧ 13 ಛྔͷ͕૿͑Δ΄Ͳʹৗ࿈ࢿՈͷׂ߹͕૿Ճ͢Δ
ͦΕͧΕͷಛྔ͝ͱͷࢿऀͷ༁ (cont’d) ඪֹۚ ($) ཧతͳڑͷΒ͖ͭ ϓϩδΣΫτͷ 14 ඪֹ͕ۚ૿Ճ͢Δ΄Ͳʹ ؾ·͙ΕࢿՈͷׂ߹͕ݮগ͢Δ ؾ·͙ΕࢿՈͷࢿ
ʹӨڹ͞Εͳ͍
ԾઆɿࢿՈͷڵຯͱࢿઌͷؔ • LDA (Latent Dirichlet Allocation) ΛͬͨτϐοΫͷྨࣅ • ࢿͨ͠ϓϩδΣΫτͷ֓ཁͱࢿՈͷTweetͷ༰ (200
tweetsఔ) • (τϐοΫṖ) ! • ৗ࿈ࢿՈࣗͷڵຯ͋ΔෳͷτϐοΫʹࢿ͕ͪ͠ • ؾ·͙ΕࢿՈࣗͷڵຯͱؔͳ͍τϐοΫʹࢿ or ͻͱͭͷτ ϐοΫʹภͬͯࢿ͍ͯ͠Δ 15
͜͜·Ͱͷ·ͱΊ • ৗ࿈ࢿՈ (4ϲ݄ؒʹ32ճҎ্ࢿͨ͠Frequent Investors) • Α͘Ϛωʔδϝϯτ͞Εɼඪֹ͕ۚߴ͘ɼࣗͷڵຯʹ߹͏ϓϩδΣ Ϋτʹࢿ͢Δʹ͋Δ • ௨ৗͷࢿՈͷΑ͏ͳڍಈΛࣔ͢
• ؾ·͙ΕࢿՈ (4ϲ݄ؒʹ4ճະຬͷࢿΛͨ͠Occasional Investors) • ࠓճબΜͩಛྔʹ͋·ΓӨڹ͞ΕͣࢿΛߦ͏ • ࢿͱ͍͏ΑΓدͱ͍͏ײ͡ 16
ืूऀͷ༑ୡ͕ଟ͍΄Ͳ୯ൃͷग़ࢿׂ߹͕૿͑Δ • ϑΝϯμʔͷFacebookͷ༑ୡͷͱ ࢿՈͷ༁ͷؔ • ҼՌؔෆ໌ ! • Facebookͷ༑ୡͷ͕ଟ͍΄Ͳؾ·͙ ΕࢿՈΛूΊ͍͢
• Facebookͷ༑ୡͷ͕গͳ͍΄Ͳৗ࿈ ࢿՈΛूΊ͍͢…??? 17
ϑΝϯμʔͱग़ࢿऀͷϚονϯά Recommending Investors 18
ϓϩδΣΫτͱࢿՈͷϚονϯάํ๏ • Twitterʹ͍ΔજࡏతͳࢿՈʹରͯ͠ϓϩδΣΫτΛਪન͢Δ • KickstarterͷϢʔβ໊͔ΒTwitterͷΞΧϯτΛඥ͚ • 7,429ਓͷࢿՈ͕891ͷϓϩδΣΫτʹࢿͨ͠σʔλΛݩʹਪન • ࢿऀ͕ࢿ͢Δ= 1ɼࢿऀ͕ࢿ͠ͳ͍
= 0ͱͨ͠ೋྨ ! • Ϋϩʔϧͨ͠σʔλਖ਼ྫͷΈͳͷͰɼϥϯμϜͰෛྫΛࠞͥΔ • ਖ਼ྫෛྫͷׂ߹50-50 19
Ϩίϝϯσʔγϣϯͷख๏ͱධՁํ๏ • ੑೳධՁ͢Δख๏4ͭ : {LR, SVM-linear, SVM-poly, SVM-RBF} • ϩδεςΟοΫճؼʢLRʣ
• 3छྨͷΧʔωϧΛ༻͍ͨSVM ʢLinear, polynomial, RBFʣ ! • ධՁɿ5-fold cross validation • σʔληοτͷ80%Ͱֶशˠ20%ͰධՁ Λ5ճ܁Γฦͯ͠ධՁΛฏۉ 20
༻͢Δಛྔ • Static Feature: ϓϩδΣΫτൃ࣌ʹΘ͔Δಛྔ • ඪֹۚɾใुͷϨϕϧɾաڈʹࢿͨ͠ϓϩδΣΫτͷΧςΰϦɾ TwitterͷߘʹΑΔࢿՈͷڵຯ • Dynamic
Feature: ϓϩδΣΫτͷਐߦʹΑͬͯ໌ͯ͘͠Δಛྔ • ϓϩδΣΫτͷޭɾߋ৽ɾίϝϯτɾཧతͳڑͷΒ͖ͭ 21
ϨίϝϯσʔγϣϯͷධՁ • RBFΧʔωϧΛ༻͍ͨSVM • Static͚ͩͷಛྔ ɿ82% • Dynamic͚ͩͷಛྔɿ73% ! •
StaticͱDynamicΛ߹ΘͤΔͱACC 84% ! • ਖ਼ྫෛྫͷׂ߹͕50-50ɿϕʔεϥΠϯ50% ACC : Accuracy P : Precision R : Recall F1 : F-score AUC : ROCۂઢԼͷ໘ੵ 22
Ͳͷಛྔ͕ޮ͍͍ͯΔͷ͔ʁ • C : ίϝϯτ • R : ใुͷϨϕϧ •
S : ཧతͳڑͷΒ͖ͭ • G: • E : ΧςΰϦʔͷҰக • TS: ڵຯ͋ΔτϐοΫͱͷྨࣅ →EͱTS͕ਫ਼্ʹد༩͍ͯ͠Δ 23
จͷ·ͱΊ • ࢿՈʹΑͬͯKickstarterͷࢿελΠϧ͕ҧ͏ • ৗ࿈ࢿՈ৺తͳϓϩδΣΫτʹࢿ͢Δ • ؾ·͙ΕࢿՈدײ֮Ͱࢿ / ܳज़ʹؔ࿈ͨ͠ϓϩδΣΫτʹࢿ •
ࢿՈͱϓϩδΣΫτͷϚονϯάՄೳ • ࢿՈ͕ϓϩδΣΫτʹࢿΛ͢Δ͔Ͳ͏͔84%ͷਫ਼ͰਪଌՄೳ • ࢿՈͷڵຯ͋ΔΧςΰϦʔ༰͕Ϛονϯάʹڧ͘Өڹ͢Δ 24