Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Review: "Recommending Investors for Crowdfundin...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
yag_ays
July 09, 2014
Research
1.2k
1
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Review: "Recommending Investors for Crowdfunding Projects"
http://yagays.github.io/blog/2014/07/09/www2014review-kickstarter/
yag_ays
July 09, 2014
More Decks by yag_ays
See All by yag_ays
対話型AIの構築における工夫とデータセットの重要性 - 素早くデータを構築し検証するためには
yag_ays
3
7.1k
目と耳を持った自然言語処理 - スタートアップにおける価値創出のために
yag_ays
1
3.9k
時間情報表現抽出とルールベース解析器のこれから / Temporal Expression Analysis in Japanese and Future of Rule-based Approach
yag_ays
1
2.3k
Pythonで始める ドキュメント・インテリジェンス入門 / Introduction to Document Intelligence with Python
yag_ays
9
9.2k
"医者の言葉、患者の言葉、エンジニアの言葉" / MNTSQ Ubie Vertical ai
yag_ays
3
14k
LT at nlp_career
yag_ays
0
350
Other Decks in Research
See All in Research
AI Agentの精度改善に見るML開発との共通点 / commonalities in accuracy improvements in agentic era
shimacos
6
1.7k
Cross-Media Information Spaces and Architectures
signer
PRO
0
300
SOTAのさらに先へ:厳しい推論制約下での高性能モデルのPost-Training
analokmaus
0
1.3k
2026 東京科学大 情報通信系 研究室紹介 (大岡山)
icttitech
0
3.8k
量子コンピュータの紹介
oqtopus
0
330
Any-Optical-Model: A Universal Foundation Model for Optical Remote Sensing
satai
3
830
Language and AI
ayaniwa
0
120
多様なデータを許容し学習し続ける模倣学習 / Advanced Imitation Learning for VLA
prinlab
0
220
業界横断 副業コンプライアンス調査 三者(副業者・本業先・発注者)におけるトラブル認知ギャップの構造分析
fkske
0
1.3k
言語モデルから言語について語る際に押さえておきたいこと
eumesy
PRO
5
2.3k
Fukui Shibiten 39 - AI Art
butchi
0
120
LLMアプリケーションの透明性について
fufufukakaka
0
240
Featured
See All Featured
RailsConf 2023
tenderlove
30
1.5k
Building Better People: How to give real-time feedback that sticks.
wjessup
370
20k
Are puppies a ranking factor?
jonoalderson
1
3.6k
Odyssey Design
rkendrick25
PRO
2
700
Bridging the Design Gap: How Collaborative Modelling removes blockers to flow between stakeholders and teams @FastFlow conf
baasie
0
590
Conquering PDFs: document understanding beyond plain text
inesmontani
PRO
4
2.8k
SEOcharity - Dark patterns in SEO and UX: How to avoid them and build a more ethical web
sarafernandez
0
200
Designing Dashboards & Data Visualisations in Web Apps
destraynor
231
55k
Mobile First: as difficult as doing things right
swwweet
225
10k
Mind Mapping
helmedeiros
PRO
1
250
Rebuilding a faster, lazier Slack
samanthasiow
85
9.5k
Building an army of robots
kneath
306
46k
Transcript
Recommending Investors for Crowdfunding Projects WWW 2014 Jisun An, Daniele
Quercia, Jon Crowcroft จհ @yag_ays 1
ࠓճհ͢Δจͷ֓ཁ • “Recommending Investors for Crowdfunding Project” [Jisun+ 2014] •
WWW 2014 (Seoul, KOREA) • Jisun AnͷYahoo Labs in Barcelona Πϯλʔϯγοϓͷࣄ ! • ࠷ऴతͳඪɿKickstarterͷϑΝϯμʔͱग़ࢿऀͷϚονϯά • KickstarterͷϓϩδΣΫτग़ࢿऀͷੳϝΠϯͳͱ͜Ζ͕͋Δ 2
• ΫϥυϑΝϯσΟϯά • 2012ʹूΊͨࢿ૯ֹ$320 million • c.f. ࢿՈ/ϕϯνϟʔΩϟϐλϧʹΑΔग़ࢿ ! •
ϑΝϯμʔඪֹۚΛઃఆͯ͠ࢿΛืΔ • ࢿՈࢿֹۚʹԠͯ͡ใु͕Β͑Δ • e.g. $100ࢿͰ1ݸϓϨθϯτɼ$300ࢿͰ5ݸϓϨθϯτ Kickstarterͱ https://www.kickstarter.com/help/style_guide 3
Kickstarterޭࣄྫ • Oculus Rift • 9,522 / $ 2,437,429 •
Memoto (Narrative Clip) • 2,871 / $ 550,189 • Little Witch Academia 2 • 7,938 / $ 625,518 https://www.kickstarter.com/projects/1523379957/oculus-rift-step-into-the-game https://www.kickstarter.com/projects/martinkallstrom/memoto-lifelogging-camera https://www.kickstarter.com/projects/1311401276/little-witch-academia-2 4
Kickstarterͷಛघੑɿࢿʹࣦഊͨ͘͠ͳ͍͚Ͳ… • All or Nothing • ඪֹۚʹୡ͠ͳ͚ΕϓϩδΣΫτࣦഊɼࢿۚશֹฦ٫ • ϓϩδΣΫτͷޭ/ࣦഊʹؔΘΒͣɼࢿऀଛΛ͠ͳ͍ !
• ιʔγϟϧͳଆ໘ • ॳظࢿʹ༑ୡ͕ଟ͍ʢ20-40%ͱ͍͏ࢉग़ʣ • ेͳࢿՈΛूΊΒΕͳ͍ͱϓϩδΣΫτ͕ࣦഊ͍͢͠ 5
จͷྲྀΕ • Kickstarterʹ͓͚ΔࢿՈͷڍಈʹ͍ͭͯԾઆΛཱͯΔ • KickstarterTwitterͷใ͔ΒԾઆΛݕূ͢Δ • ࣗಈతʹϑΝϯμʔͱࢿՈͷϚονϯάΛߦ͏ϞσϧΛཱͯΔ 6
Kickstarterͷσʔλऩू/ղੳ Dataset and Pledging Behavior 7
σʔληοτ • Kickstarter͔ΒΫϩʔϧ • 20137݄͔Β10݄ʹొ͞Εͨͷ • USA෦ͷϑΝϯυͷΈ • ߹ܭ 1,149ϓϩδΣΫτ/
78,460ग़ࢿऀ • Twitter͔ΒΫϩʔϧ • ϓϩδΣΫτʹݴٴ͢ΔtweetͷΈ • ߹ܭ71,315 tweetΛऩू 8 (Average)
ࢿՈͷߏ • ߹ܭ78,460ਓͷग़ࢿऀ ! • 4ճະຬͷࢿΛͨ͠ਓ51% • ؾ·͙Εࢿऀ “Occasional Investors”
• 32ճҎ্ͷࢿΛͨ͠ਓ11% • ৗ࿈ࢿऀ “Frequent Investors” ؾ·͙ΕࢿՈ ৗ࿈ࢿՈ 9
ϓϩδΣΫτͷΧςΰϦʔ͝ͱͷࢿऀͷ༁ • Music, DanceͳͲ୯ൃͷग़ࢿ͕ଟ͍ • Gamesৗ࿈ࢿՈ͕ଟ͍ˠେنͳήʔϜ։ൃͷืूͳͲ 10
ࢿՈͷڍಈʹؔ͢ΔԾઆ • ৗ࿈ࢿՈҎԼͷΑ͏ͳੑ࣭ͷϓϩδΣΫτʹࢿ͍͢͠ • ใ͕සൟʹΞοϓσʔτ͞ΕΔ • ϑΝϯμʔ͕ࢿՈͷ࣭ʹ͑Δ • ࢿͷใु͕ྑ͍ •
ߴ͍ࢿֹۚͷϓϩδΣΫτৗ࿈ࢿՈʹࢿ͞Ε͍͢ • ϩʔΧϧͳϓϩδΣΫτؾ·͙ΕࢿՈʹࢿ͞Ε͍͢ • ૣ͘ࢿΛूΊΔϓϩδΣΫτৗ࿈ࢿՈʹࢿ͞Ε͍͢ • ৗ࿈ࢿՈࣗͷڵຯ͋ΔϓϩδΣΫτʹࢿ͍͢͠ 11
ϓϩδΣΫτͰ͢Δಛྔ • ϓϩδΣΫτͷߋ৽ • ϑΝϯμʔͷίϝϯτ • ใुͷϨϕϧ • ΣϒαΠτͷ༗ແ •
ඪֹۚ ($) • ཧతͳڑͷΒ͖ͭ • ϓϩδΣΫτͷ 12
ͦΕͧΕͷಛྔ͝ͱͷࢿऀͷ༁ ϓϩδΣΫτͷߋ৽ ϑΝϯμʔͷίϝϯτ ใुͷϨϕϧ 13 ಛྔͷ͕૿͑Δ΄Ͳʹৗ࿈ࢿՈͷׂ߹͕૿Ճ͢Δ
ͦΕͧΕͷಛྔ͝ͱͷࢿऀͷ༁ (cont’d) ඪֹۚ ($) ཧతͳڑͷΒ͖ͭ ϓϩδΣΫτͷ 14 ඪֹ͕ۚ૿Ճ͢Δ΄Ͳʹ ؾ·͙ΕࢿՈͷׂ߹͕ݮগ͢Δ ؾ·͙ΕࢿՈͷࢿ
ʹӨڹ͞Εͳ͍
ԾઆɿࢿՈͷڵຯͱࢿઌͷؔ • LDA (Latent Dirichlet Allocation) ΛͬͨτϐοΫͷྨࣅ • ࢿͨ͠ϓϩδΣΫτͷ֓ཁͱࢿՈͷTweetͷ༰ (200
tweetsఔ) • (τϐοΫṖ) ! • ৗ࿈ࢿՈࣗͷڵຯ͋ΔෳͷτϐοΫʹࢿ͕ͪ͠ • ؾ·͙ΕࢿՈࣗͷڵຯͱؔͳ͍τϐοΫʹࢿ or ͻͱͭͷτ ϐοΫʹภͬͯࢿ͍ͯ͠Δ 15
͜͜·Ͱͷ·ͱΊ • ৗ࿈ࢿՈ (4ϲ݄ؒʹ32ճҎ্ࢿͨ͠Frequent Investors) • Α͘Ϛωʔδϝϯτ͞Εɼඪֹ͕ۚߴ͘ɼࣗͷڵຯʹ߹͏ϓϩδΣ Ϋτʹࢿ͢Δʹ͋Δ • ௨ৗͷࢿՈͷΑ͏ͳڍಈΛࣔ͢
• ؾ·͙ΕࢿՈ (4ϲ݄ؒʹ4ճະຬͷࢿΛͨ͠Occasional Investors) • ࠓճબΜͩಛྔʹ͋·ΓӨڹ͞ΕͣࢿΛߦ͏ • ࢿͱ͍͏ΑΓدͱ͍͏ײ͡ 16
ืूऀͷ༑ୡ͕ଟ͍΄Ͳ୯ൃͷग़ࢿׂ߹͕૿͑Δ • ϑΝϯμʔͷFacebookͷ༑ୡͷͱ ࢿՈͷ༁ͷؔ • ҼՌؔෆ໌ ! • Facebookͷ༑ୡͷ͕ଟ͍΄Ͳؾ·͙ ΕࢿՈΛूΊ͍͢
• Facebookͷ༑ୡͷ͕গͳ͍΄Ͳৗ࿈ ࢿՈΛूΊ͍͢…??? 17
ϑΝϯμʔͱग़ࢿऀͷϚονϯά Recommending Investors 18
ϓϩδΣΫτͱࢿՈͷϚονϯάํ๏ • Twitterʹ͍ΔજࡏతͳࢿՈʹରͯ͠ϓϩδΣΫτΛਪન͢Δ • KickstarterͷϢʔβ໊͔ΒTwitterͷΞΧϯτΛඥ͚ • 7,429ਓͷࢿՈ͕891ͷϓϩδΣΫτʹࢿͨ͠σʔλΛݩʹਪન • ࢿऀ͕ࢿ͢Δ= 1ɼࢿऀ͕ࢿ͠ͳ͍
= 0ͱͨ͠ೋྨ ! • Ϋϩʔϧͨ͠σʔλਖ਼ྫͷΈͳͷͰɼϥϯμϜͰෛྫΛࠞͥΔ • ਖ਼ྫෛྫͷׂ߹50-50 19
Ϩίϝϯσʔγϣϯͷख๏ͱධՁํ๏ • ੑೳධՁ͢Δख๏4ͭ : {LR, SVM-linear, SVM-poly, SVM-RBF} • ϩδεςΟοΫճؼʢLRʣ
• 3छྨͷΧʔωϧΛ༻͍ͨSVM ʢLinear, polynomial, RBFʣ ! • ධՁɿ5-fold cross validation • σʔληοτͷ80%Ͱֶशˠ20%ͰධՁ Λ5ճ܁Γฦͯ͠ධՁΛฏۉ 20
༻͢Δಛྔ • Static Feature: ϓϩδΣΫτൃ࣌ʹΘ͔Δಛྔ • ඪֹۚɾใुͷϨϕϧɾաڈʹࢿͨ͠ϓϩδΣΫτͷΧςΰϦɾ TwitterͷߘʹΑΔࢿՈͷڵຯ • Dynamic
Feature: ϓϩδΣΫτͷਐߦʹΑͬͯ໌ͯ͘͠Δಛྔ • ϓϩδΣΫτͷޭɾߋ৽ɾίϝϯτɾཧతͳڑͷΒ͖ͭ 21
ϨίϝϯσʔγϣϯͷධՁ • RBFΧʔωϧΛ༻͍ͨSVM • Static͚ͩͷಛྔ ɿ82% • Dynamic͚ͩͷಛྔɿ73% ! •
StaticͱDynamicΛ߹ΘͤΔͱACC 84% ! • ਖ਼ྫෛྫͷׂ߹͕50-50ɿϕʔεϥΠϯ50% ACC : Accuracy P : Precision R : Recall F1 : F-score AUC : ROCۂઢԼͷ໘ੵ 22
Ͳͷಛྔ͕ޮ͍͍ͯΔͷ͔ʁ • C : ίϝϯτ • R : ใुͷϨϕϧ •
S : ཧతͳڑͷΒ͖ͭ • G: • E : ΧςΰϦʔͷҰக • TS: ڵຯ͋ΔτϐοΫͱͷྨࣅ →EͱTS͕ਫ਼্ʹد༩͍ͯ͠Δ 23
จͷ·ͱΊ • ࢿՈʹΑͬͯKickstarterͷࢿελΠϧ͕ҧ͏ • ৗ࿈ࢿՈ৺తͳϓϩδΣΫτʹࢿ͢Δ • ؾ·͙ΕࢿՈدײ֮Ͱࢿ / ܳज़ʹؔ࿈ͨ͠ϓϩδΣΫτʹࢿ •
ࢿՈͱϓϩδΣΫτͷϚονϯάՄೳ • ࢿՈ͕ϓϩδΣΫτʹࢿΛ͢Δ͔Ͳ͏͔84%ͷਫ਼ͰਪଌՄೳ • ࢿՈͷڵຯ͋ΔΧςΰϦʔ༰͕Ϛονϯάʹڧ͘Өڹ͢Δ 24