Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
keisu_special_lecture_20210511.pdf
Search
Taro Takaguchi
May 10, 2021
Technology
0
490
keisu_special_lecture_20210511.pdf
Taro Takaguchi
May 10, 2021
Tweet
Share
More Decks by Taro Takaguchi
See All by Taro Takaguchi
takaguchi_15th_neteco.pdf
ttakaguchi
0
480
KDD2018 ダイジェスト @ Data Ship Update Lecture #6
ttakaguchi
1
110
ウェブ企業の非研究者ポジションで行うサイエンス
ttakaguchi
0
3.6k
Other Decks in Technology
See All in Technology
アセスメントで紐解く、10Xのデータマネジメントの軌跡
10xinc
1
360
古き良き Laravel のシステムは関数型スタイルでリファクタできるのか
leveragestech
1
640
Spring Bootで実装とインフラをこれでもかと分離するための試み
shintanimoto
5
470
開発視点でAWS Signerを考えてみよう!! ~コード署名のその先へ~
masakiokuda
3
140
Would you THINK such a demonstration interesting ?
shumpei3
1
160
似たような課題が何度も蘇ってくるゾンビふりかえりを撲滅するため、ふりかえりのテーマをフォーカスしてもらった話 / focusing on the theme
naitosatoshi
0
390
Vision Pro X Text to 3D Model ~How Swift and Generative Al Unlock a New Era of Spatial Computing~
igaryo0506
0
260
Рекомендации с нуля: как мы в Lamoda превратили главную страницу в ключевую точку входа для персонализированного шоппинга. Данил Комаров, Data Scientist, Lamoda Tech
lamodatech
0
440
GitHub MCP Serverを使って Pull Requestを作る、レビューする
hiyokose
2
710
アジャイル脅威モデリング#1(脅威モデリングナイト#8)
masakane55
3
170
はじめてのSDET / My first challenge as a SDET
bun913
1
200
20250413_湘南kaggler会_音声認識で使うのってメルス・・・なんだっけ?
sugupoko
1
400
Featured
See All Featured
Statistics for Hackers
jakevdp
798
220k
Into the Great Unknown - MozCon
thekraken
37
1.7k
[RailsConf 2023 Opening Keynote] The Magic of Rails
eileencodes
29
9.4k
The Web Performance Landscape in 2024 [PerfNow 2024]
tammyeverts
5
520
Faster Mobile Websites
deanohume
306
31k
[RailsConf 2023] Rails as a piece of cake
palkan
54
5.4k
Stop Working from a Prison Cell
hatefulcrawdad
268
20k
Producing Creativity
orderedlist
PRO
344
40k
Intergalactic Javascript Robots from Outer Space
tanoku
270
27k
The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024
eileencodes
23
2.6k
The Illustrated Children's Guide to Kubernetes
chrisshort
48
49k
Measuring & Analyzing Core Web Vitals
bluesmoon
7
390
Transcript
͋Δࣄۀձࣾʹ͓͚Δ σʔλαΠΤϯεͷ࣮ ߴޱ ଠ࿕ LINEגࣜձࣾ Data Science ηϯλʔ ܭֶಛผߨٛ ౦ژେֶ
ֶ෦ ܭֶՊ 2021/05/11 1
ߴޱ ଠ࿕ʢ͔͙ͨͪ ͨΖ͏ʣ LINEגࣜձࣾ Data Science ηϯλʔ γχΞσʔλαΠΤϯςΟετ / Ϛωʔδϟʔ
~2013ɹ౦ژେֶେֶӃ ใཧֶܥݚڀՊ ཧใֶઐ߈ ɹɹɹ ത࢜՝ఔʢཧใୈ̐ݚڀࣨʣ ~2017ɹࠃཱݚڀػؔʹͯϙευΫݚڀһɹ ܦྺ ࣌ͷઐ 2 ωοτϫʔΫՊֶʢಛʹ࣌ؒతʹมԽ͢ΔωοτϫʔΫʣ
اۀʹస͖͔͚ͨͬ͡ 3 2 2 2 1 1 1 3 3
4 4 3 4 ʹཱͪͦ͏ɺͦΕͰ࣮ࣾձͱͷڑԕ͍… @ LINE DEVELOPER DAY 2019 σʔλΛ׆༻ͨ͠ࣄۀͷ࠷લઢΛ ݟ͍ͨɾؔΘΓ͍ͨ
ςʔϚ ʮ͋Δࣄۀձࣾʹॴଐ͢ΔσʔλαΠΤϯςΟετ͕ɺ ͲΜͳࣄͰษڧݚڀͷܦݧΛ׆͔͍ͯ͠Δ͔ʁʯ 4
શମͷߏ 1. ରΛΔɿ ࣄۀձࣾͷσʔλαΠΤϯςΟετͬͯͲΜͳࣄʁ ʢٳܜʣ 2. தΛΔɿ ࣮ͰΑ͘༻͍Δ౷ܭͷҰ෦ͱ۩ମతͳࣄྫ 5
1. ରΛΔɿ ࣄۀձࣾͷσʔλαΠΤϯςΟετͬͯ ͲΜͳࣄʁ 6
ͦͦσʔλαΠΤϯςΟετͱʁ 7 - اۀɾ࣌ظɾίϛϡχςΟʹΑΓఆ༷ٛʑ - ಉ͡৬໊ͰҟͳΔۀɺҟͳΔ৬໊Ͱڞ௨͢Δۀ ࢦඪΛఆٛ͠ܭଌ͢Δ / ετʔϦʔΛޠΔ /
πʔϧΛ࡞Δ Analyticsʢੳܕʣ ػցֶशͷख๏ΛɾαʔϏεʹ࣮͢Δ AlgorithmsʢΞϧΰϦζϜܕʣ ౷ܭख๏ʹΑΓҼՌؔΛཱূ͢Δ Inferenceʢਪܕʣ Ref. https://www.linkedin.com/pulse/one-data-science-job-doesnt-fit-all-elena-grewal/ Data Scientist ྨͷҰྫɿ
λεΫ͝ͱʹׂ͕͔ΕΔ 8 Q1. ɾαʔϏεʹ࣮͞ΕΔίʔυΛॻ͘ʁ Analytics ʢੳܕʣ Algorithms ʢΞϧΰϦζϜܕʣ Inference ʢਪܕʣ
Q2. ౷ܭख๏ʹΑΓҼՌؔΛݕূ͢Δʁ Yes Yes No No ྨͷҰྫɿ
৫ߏ୲ྖҬʹରԠ͍ͯ͠Δ 9 Data Science ηϯλʔ Data Science Machine Learning Machine
Learning Research Analyticsʢੳܕʣ AlgorithmsʢΞϧΰϦζϜܕʣ Inferenceʢਪܕʣ جૅݚڀ͓ΑͼࣄۀͷԠ༻ ػցֶशΤϯδχΞ
ੳɾਪܕͷ۩ମతͳࣄ ਐߦத ࣄલ ࣄޙ ࣌ظ 10 Ωϟϯϖʔϯ / ৽ػೳͷՃ /
طଘػೳͷมߋͳͲ
ੳɾਪܕͷ۩ମతͳࣄ ਐߦத ࣄલ ࣄޙ ࣌ظ 11 Ωϟϯϖʔϯ / ৽ػೳͷՃ /
طଘػೳͷมߋͳͲ - Ωϟϯϖʔϯͷ݅બఆ - ৽ػೳͷχʔζݟੵΓ - ػೳมߋͷӨڹͷݟੵΓ - etc.
ੳɾਪܕͷ۩ମతͳࣄ ਐߦத ࣄલ ࣄޙ ࣌ظ 12 Ωϟϯϖʔϯ / ৽ػೳͷՃ /
طଘػೳͷมߋͳͲ - ΦϯϥΠϯ A/B ςετ - μογϡϘʔυͷ࡞ ओཁͳࣄۀࢦඪͷϞχλϦϯάද - ҟৗͳมԽͷݕग़ - etc.
ੳɾਪܕͷ۩ମతͳࣄ ਐߦத ࣄલ ࣄޙ ࣌ظ 13 Ωϟϯϖʔϯ / ৽ػೳͷՃ /
طଘػೳͷมߋͳͲ - ࢪࡦͷޮՌݕূ - ҼՌਪ - ظతมԽͷཁҼղ - etc.
νʔϜɺϓϩδΣΫτɺϓϩμΫτ 14 νʔϜ
νʔϜɺϓϩδΣΫτɺϓϩμΫτ 15 νʔϜ ϓϩδΣΫτ
νʔϜɺϓϩδΣΫτɺϓϩμΫτ 16 νʔϜ ϓϩδΣΫτ ϓϩμΫτ ྫɿϓϩμΫτʮLINE ΞϓϦʯͷ ◦◦ػೳՃϓϩδΣΫτʹؔΘΔ Data Science
νʔϜ
ʢνʔϜ|ϓϩδΣΫτ|ϓϩμΫτʣϚωδϝϯτ 17 νʔϜ ϓϩδΣΫτ ϓϩμΫτ ৫ͷඪΛઃఆ͠ɺͦͷ࣮ݱͷͨΊʹ ࿑ྗɾ࣌ؒɾ͓ۚͷΛௐ͠ޮԽ͢Δ ※ આ໌ͷͨΊʹ୯७Խ͍ͯ͠·͢
ʮయܕతͳ̍ͷࣄ༰ʁʯ 18 ࣌ظ ϓϩδΣΫτ A ϓϩδΣΫτ B ϓϩδΣΫτ C ͱ͋Δ
1 λεΫ͕ؒΛۭ͚ͯஅଓతʹਐߦ͢Δ e.g. ଞνʔϜͷਐߦͪɺಥൃతͳґཔ
ੳɾਪܕͷλεΫɿ՝ղܾͷαΠΫϧ ؍ଌ Ծઆͱ՝ ͷઃఆ ݕূ ղܾࡦͷཱҊ 19 ࣌ظ
εςοϓ̍. ؍ଌ ؍ଌ Ծઆͱ՝ ͷઃఆ ݕূ ղܾࡦͷཱҊ 20 ࣌ظ ࠷ۙɺΞΫςΟϒϢʔβʔ
͕ఀ͍ͯ͠Δʁ 2݄ 3݄ 4݄ 5݄ μογϡϘʔυɿ ओཁͳࣄۀࢦඪͷϞχλϦϯάද ※ ΓऔΓͱͯ͢Սۭͷͷ
εςοϓ̎. Ծઆͱ՝ͷઃఆ ؍ଌ Ծઆͱ՝ ͷઃఆ ݕূ ղܾࡦͷཱҊ 21 ࣌ظ ͜ͷΞΫςΟϒϢʔβʔͷ
ਪҠରॲ͖͢ͷ͔ʁ - ྫͷقઅతͳมಈʁ - Ϣʔβʔͷηάϝϯτ͝ͱͷมԽʁ - ৽ن / طଘ / ෮ؼ - ଞػೳͷར༻ϢʔβʔͷਪҠʁ → ʮ৽نϢʔβʔͷܧଓ͕Լ ͍ͯ͠Δɻݩͷਫ४ʹճ෮͢Δͱ ˓ສਓ૿ՃͷӨڹ͕͋Δʯ ※ ΓऔΓͱͯ͢Սۭͷͷ
εςοϓ̏. ղܾࡦͷཱҊ ؍ଌ Ծઆͱ՝ ͷઃఆ ݕূ ղܾࡦͷཱҊ 22 ࣌ظ -
৽نϢʔβʔʹϩάΠϯΛ ଅ͢௨ΛૹΖ͏ - ௨ͷසΛςετ͍ͨ͠ ςετͷઃܭΛ͠·͢ - ൱ΛධՁ͢Δࢦඪͷܾఆ - ςετʹඞཁͳαϯϓϧ αΠζͷܭࢉ - ൱ͷஅج४ͷ߹ҙ ※ ΓऔΓͱͯ͢Սۭͷͷ
εςοϓ̐. ݕূ ؍ଌ Ծઆͱ՝ ͷઃఆ ݕূ ղܾࡦͷཱҊ 23 ࣌ظ ςετͷ݁ՌΛੳ͠·͢
- σʔλͷਖ਼ৗͳऩूͷ֬ೝ - ࢦඪʹର͢ΔԾઆݕఆ - ՃͷվળҊͷࣔࠦ - ૯߹తͳϨϙʔςΟϯά Ճೖཌʹ̍ճ͚ͩ௨Λ ૹΔҊΛ࠾༻͢Δ ※ ΓऔΓͱͯ͢Սۭͷͷ
εςοϓ̍(2). ؍ଌ ؍ଌ Ծઆͱ՝ ͷઃఆ ݕূ ղܾࡦͷཱҊ 24 ࣌ظ ৽نϢʔβʔͷܧଓ
ࠓޙϞχλϦϯά͠·͢ ※ ΓऔΓͱͯ͢Սۭͷͷ 2݄ 3݄ 4݄ 5݄ 6݄ 2݄ 3݄ 4݄ 5݄ 6݄ μογϡϘʔυʹ߲ΛՃ͢Δ ΞΫςΟϒϢʔβʔ ৽نϢʔβʔܧଓ
ੳɾਪܕͷλεΫɿ՝ղܾͷαΠΫϧ ؍ଌ Ծઆͱ՝ ͷઃఆ ݕূ ղܾࡦͷཱҊ 25 ࣌ظ - ੳɾਪͷλεΫ
ؔऀͱͷίϛϡχέʔγϣϯΛ ௨ͯ͡ਐߦ͢Δ - ౷ܭͳͲઐࣝͷ׆༻ɺ શମͷαΠΫϧͷதͷҰཁૉ - ࠷ऴతͳҙࢥܾఆऀɺࣄۀɾ ϓϩμΫτɾϓϩδΣΫτͷऀ
ઐతͳֶͷࣝ͏ʁ 26 “LIFE AND MATHS”, © Pearls of Raw Nerdism
http://pearlsofrawnerdism.com/life-and-maths/
ઐతͳֶͷࣝ͏ʁ 27 “LIFE AND MATHS”, © Pearls of Raw Nerdism
http://pearlsofrawnerdism.com/life-and-maths/ ࢲͷߟ͑ɿ - ઐతͳֶͳ͠ͰࡁΉػձͷ΄͏͕ଟ͍ - ઐ͕ࣝ͋Δͱɺ՝ղܾͷ֤εςοϓͷ্࣭͕͕Δ
ઐతͳֶͳ͠ͰࡁΉػձͷ΄͏͕ଟ͍ ൃੜස ֶతͳ ෳࡶ 28 ߴ ߴ ֓೦ਤ
ੳɾਪܕͷׂ ˚ෳࡶͳ͜ͱΛߦ͢Δ͜ͱ ˚ཧతʹ৽نͳ͜ͱΛߦ͏͜ͱ ✓ ࣄۀʹཱͭݟΛదʹఏڙ͢Δ͜ͱ ʮࣄۀʹର͢Δߩݙʯ ʮ࣮ࢪʹཁ͢Δίετʯͷ͕࣠ӅΕ͍ͯΔ ֶతͳ͠͞ ≠ ࣄۀ্ͷ՝ղܾͷ͠͞
ฏқͳ࡞ۀɺઐతͳۀͷ ྫɿ୯७ͳूܭ࡞ۀ ઐࣝΛ ൃش͢Δۀ 29 ฏқͳ࡞ۀΛ௨ͨ͡σʔλɾࣄۀͷशख़ → ઐࣝΛൃش͢Δۀͷߦ ࣄۀʹର͢Δཧղͷ্ →
࣮ࢪʹίετͷ͔͔ΔੳλεΫͷड
ઐ͕ࣝ͋Δͱɺ՝ղܾͷ֤εςοϓͷ্࣭͕͕Δ 30 ؍ଌ Ծઆͱ՝ͷઃఆ ݕূ ղܾࡦͷཱҊ ΑΓϊΠζʹؤ݈Ͱղऍͷ͍͢͠ࢦඪΛ ఆٛͰ͖Δ ʮσʔλͱֶʹΑͬͯղ͚Δʯͷ ఆࣜԽͷϨύʔτϦʔ͕૿͑Δ
ద͔ͭޮతͳղܾࡦΛબͰ͖Δ ҙࢥܾఆʹඞཁͳݟΛత֬ʹநग़Ͱ͖Δ
֓೦ͷ֫ಘੈքͷݟ͑ํΛม͑Δ 31 ՝ɿ̎Λ̍ສݸͨ͑͠ΛΓ͍ͨ ࢉͷ֓೦ΛΒͳ͍ͱ 2 + 2 + 2 +
2 + …… ʮݱ࣮తͳ࣌ؒͰղܾͰ͖·ͤΜʯ ࢉΛ͍ͬͯΕ 2 × 10,000 = 20,000 ղ͚ͳ͍ ղ͚Δ
ʮ͑Λग़͢ͱࣄۀʹཱͭʯྖҬΛࢦ͢ 32 ࣄۀՁʹ ݁ͼͭ͘ ࣄۀՁʹ ݁ͼ͖ͭͮΒ͍ ͑Λग़ͤΔ ͑Λग़ͤͳ͍ ઐࣝͷशಘ ࣄۀͷཧղ
ؔऀͱͷର Cf. ҆ਓ, ʮΠγϡʔ͔Β͡ΊΑʕతੜ࢈ͷʰγϯϓϧͳຊ࣭ʱʯ, ӳ࣏ग़൛ʢ2010ʣ σʔλαΠΤϯςΟετͷۀ্ͷλεΫΛ̎࣍ݩʹϚοϓ͢Δ
ࣄۀձࣾͷσʔλαΠΤϯςΟετͷࣄ 33 ࣄۀͷͨΊͷ՝ղܾͷαΠΫϧ ੳܕ / ΞϧΰϦζϜܕ / ਪܕ νʔϜͱͯ͠ϓϩδΣΫτʹऔΓΉ ྨʢҰྫʣ
Ґஔ͚ͮ ੳɾਪͷλεΫ ઐతͳࣝ ՝ղܾͷ࣭Λ্͛Δ
ͲΜͳڥͩͱྗΛൃش͍͔͢͠ʁ 34 A. αΠΤϯε͕Ͱ͖Δ͜ͱ Պֶతํ๏ʹج͍ͮͯۀΛߦ͠ɺՌ͕ೝ͞ΕΔ͜ͱ - ٬؍తͳࠜڌʹج͍ͮͯɺཧΛల։͢Δ͜ͱ - खଓ͖͕ه͞Εɺ࠶ݱՄೳͰ͋Δ͜ͱ -
ͱ͘ʹ͕݁ޡΓͩͬͨ߹ʹɺݕূՄೳͰ͋Δ͜ͱ
ۀΛαΠΤϯεʹ͢ΔͨΊʹ 1. ܧଓ͢Δ 2. ԾఆΛڞ༗͢Δ 3. ࣈΛݟΔલʹஅج४ΛܾΊΔ 35 σʔλαΠΤϯςΟετଆʹ৺͕͚Δ͖͜ͱ͕͋Δ
1. ܧଓ͢Δ 36 Պֶతํ๏ɺ܁Γฦ͢͜ͱʹҙ͕ٛ͋Δ ԿΛ͖͔͢ʁ - ࠶ݱɾݕূՄೳͳΑ͏ʹهΛ͢ - ҡ࣋Մೳͳ؍ଌํ๏Λߏங͢Δ ʢϞχλϦϯάͷࣗಈԽʣ
- ࣍ͷ՝ઃఆΛଅࣔࠦ͢Λఏڙ͢Δ
2. ԾఆΛڞ༗͢Δ 37 ܦݧՊֶʹ͓͚ΔՊֶతࣝ ✗ ઈରෆมͷਅ࣮ͷू߹ ✓ ؍ଌͱԾఆʹج͍ͮͯਪ͞Εͨؼ݁ ԿΛ͖͔͢ʁ -
ԾఆΛ໌֬ʹ͑Δ ʮϢʔβʔͷ૿Ճઌ݄ͱಉ͡ͱԾఆ͠·͢ʯ - ݕূͷεςοϓͰɺࣄલͷԾఆͷଥੑݕূ͢Δ ʮϢʔβʔͷ૿Ճɺ݁Ռతʹઌ݄ͱൺͯʙͰͨ͠ʯ
3. ࣈΛݟΔલʹஅج४ΛܾΊΔ 38 ྔ → ৗݴޠͷมʹᐆດੑ͕͋Δ ͜ͷࢦඪ͕ “ेʹ” ্ঢͨ͠Β ςετޭͱஅ͠·͠ΐ͏
ʢ+3% ”े” ͩΖ͏͔…ʣ ԿΛ͖͔͢ʁ - ࣄલʹஅج४ΛྔతʹܾΊΔ - ج४ͷࠜڌ٬؍తʹ͢Δ (ྫ) ࣄۀඪʹର͢Δظد༩ ɹɹ͡ΒΕͨίετͷճऩ ɹɹաڈͷྨࣅࣄྫͷ݁Ռ ※ ͯ͢Սۭͷͷ ࢦඪͷ্ঢ +3% Ͱͨ͠
ٳܜ 39
શମͷߏ 1. ରΛΔɿ ࣄۀձࣾͷσʔλαΠΤϯςΟετͬͯͲΜͳࣄʁ ʢٳܜʣ 2. தΛΔɿ ࣮ͰΑ͘༻͍Δ౷ܭͷҰ෦ͱ۩ମతͳࣄྫ 40
2. தΛΔɿ ࣮ͰΑ͘༻͍Δ౷ܭͷҰ෦ͱ۩ମతͳࣄྫ 41
ੳɾਪͷ۩ମతͳࣄʢ࠶ܝʣ 42 ਐߦத ࣄલ ࣄޙ ࣌ظ Ωϟϯϖʔϯ / ৽ػೳͷՃ /
طଘػೳͷมߋͳͲ ΦϯϥΠϯ A/B ςετ 1. αϯϓϧαΠζͷܭࢉ 2. ଟॏൺֱ 3. ׳ΕޮՌͷਪఆ
1. αϯϓϧαΠζͷܭࢉ 43 ςετରͷࠩΛݕఆ͢ΔͨΊʹඞཁͳαϯϓϧαΠζΛࢉग़͢Δ͜ͱ Q. αϯϓϧαΠζΛܭࢉܾͯ͠ΊΔཧ༝ʁ A. ౷తͳԠ༻Ͱɺαϯϓϧऩूͷίετ͕ߴ͔ͬͨ ɹe.g. ྟচݚڀɺۀ
Q. ΣϒαʔϏεͳΒαϯϓϧऩूͷίετߴ͘ͳ͍ͷͰʁ
ΣϒαʔϏεͰαϯϓϧαΠζΛܭࢉ͢Δཧ༝ 44 1. աʹେ͖ͳαϯϓϧαΠζ → খ͞ͳมԽͰ༗ҙʹͳΓ͕ͪ ʮ౷ܭతʹ༗ҙʯڧ͍ҹΛ༩͑Δදݱ 2. ಛʹςετҊ͕ྑ͘ͳ͍࣌ɺϢʔβʔʹແ༻ͳӨڹΛ༩͑ͯ͠·͏ 4.
P-Hacking ͷ༨͕Δ ʮ༗ҙ͕ࠩग़ͳ͔͔ͬͨΒαϯϓϧαΠζΛେ͖ͯ͘͠࠶ςετ͠Α͏ʯ 3. SUTVA (Stable Unit Treatment Value Assumption) ͕ഁΕ͘͢ͳΔ ʮ͋ΔϢʔβʔͷߦಈଞͷϢʔβʔͷׂΓͯʹӨڹ͞Εͳ͍ʯ ʢྫʣςετը໘͕ڞ༗͞ΕΔɺϝσΟΞʹऔΓ্͛ΒΕΔ
αϯϓϧαΠζܭࢉͷجຊܗ 45 ઃఆ - ಠཱͳ̎܈αϯϓϧͷฏۉͷݕఆ - ࢄ̎܈Ͱಉ͡ & ط -
αϯϓϧαΠζઍ ~ ສ݅ఔऔΕΔ ݕఆ͞ΕΔԾઆ - ؼແԾઆ - ରཱԾઆ H0 H1 μ1 − μ2 = 0 μ1 − μ2 ≠ 0 αϯϓϧαΠζɹͷܾఆʹඞཁͳύϥϝʔλ - ༗ҙਫ४ - ݕग़ྗ - ޮՌྔ - ࢄ α 1 − β δ = μ1 − μ2 σ2 < + ∞ n ʢɹ ͕ਅͷ߹ʣ H1
αϯϓϧαΠζܭࢉͷ෮शʢ̍ʣ 46 ਤԼهจݙΑΓ࠶ߏͨ͠ Gerald van Belle, “Statistical Rules of Thumb”
(2nd edition), Wiley, 2008 ඪຊฏۉͷࠩ x1 − x2 H0 : μ1 − μ2 = 0 0 S . E . = σ 2 n ਖ਼نͷ࠶ੜੑΑΓ α 2 α 2 ༗ҙਫ४ɹɿ α ɹ͕ਅͷͱ͖ɹ Λ࠾ͯ͠͠·͏֬ ʢِཅੑʣ H0 H1
αϯϓϧαΠζܭࢉͷ෮शʢ̎ʣ 47 ਤԼهจݙΑΓ࠶ߏͨ͠ Gerald van Belle, “Statistical Rules of Thumb”
(2nd edition), Wiley, 2008 ඪຊฏۉͷࠩ x1 − x2 H1 : μ1 − μ2 = δ δ S . E . = σ 2 n H0 : μ1 − μ2 = 0 0 β = 1− ݕग़ྗ (1 − β) ɹ͕ਅͷͱ͖ɹ Λ࠾ͯ͠ ͠·͏֬ʢِӄੑʣ H0 H1 β
αϯϓϧαΠζܭࢉͷ෮शʢ̏ʣ 48 ਤԼهจݙΑΓ࠶ߏͨ͠ Gerald van Belle, “Statistical Rules of Thumb”
(2nd edition), Wiley, 2008 ඪຊฏۉͷࠩ x1 − x2 δ S . E . = σ 2 n 0 β n* = 2σ2 (z1−α/2 + z1−β) 2 δ2 㱺 ཁ݅Λຬͨͨ͢Ίʹ ࠷ݶඞཁͳαϯϓϧαΠζ z1−α/2 σ 2 n* = δ − z1−β σ 2 n* ඪ४ਖ਼نͷ Ґؔ α 2
1. ਅͷͷࢄɹ ͕େ 2. ِཅੑɺِӄੑΛ͑Δ ɹɹ͕খ 3. ݕग़͍ͨ͠ޮՌྔɹ͕খ ͕େ͖͘ͳΔཁҼ n*
σ2 α, β δ αϯϓϧαΠζʹ͍ͭͯͷิ 49 n* = 2σ2 (z1−α/2 + z1−β) 2 δ2 ཁ݅Λຬͨͨ͢Ίʹ ࠷ݶඞཁͳαϯϓϧαΠζ ύϥϝʔλͷܾΊํʢҰྫʣ ɹɹɿ׳शతͳ͔ɺݫ͠ʹ ɹɹɿۙͷ࣮ଌ ɹɹʢςετޙʹଥੑΛ֬ೝʣ ɹɹɿ׳शతͳ ɹɹɹor ίετΛ্ճΔޮՌ ɹɹɹor աڈͷྨࣅ͢Δςετ݁Ռ α, β σ2 δ
2. ଟॏൺֱ 50 ࣮Ͱɺ̏܈Ҏ্ͷൺֱΛٻΊΒΕΔ͜ͱ͕Α͋͘Δ 㲗 લઅͰ෮शͨ̎͠܈ؒͷݕఆ എܠ ظؒͰͳΔ͘ଟ͘ͷՄೳੑΛςετ͍ͨ͠ ܈ؒϖΞͷճ͚ͩ୯७ʹݕఆΛ܁Γฦ͍͚ͯ͠ͳ͍ ʂʂ
ଟॏൺֱ ͷ →
࣮ྫɿάϧʔϓ࡞ը໘ͷมߋςετ 51 “ίϛϡχέʔγϣϯΞϓϦʮLINEʯͷػೳվળΛࢧ͑ΔσʔλαΠΤϯε” LINE DEVELOPER DAY 2019 https://linedevday.linecorp.com/jp/2019/sessions/B1-3 άϧʔϓ࡞ͷखॱΛɺΑΓ͍͘͢վྑ͍ͨ͠
̎ͭͷมߋΛΈ߹Θͤͯࢼ͍ͨ͠ 52 “ίϛϡχέʔγϣϯΞϓϦʮLINEʯͷػೳվળΛࢧ͑ΔσʔλαΠΤϯε” LINE DEVELOPER DAY 2019 https://linedevday.linecorp.com/jp/2019/sessions/B1-3 1. ࠷ۙτʔΫͨ͠༑ͩͪΛ༏ઌදࣔ͢Δ
2. खॱΛ̍ը໘ʹ·ͱΊΔ
̎×̎=̐௨Γ̒ϖΞͷݕఆʁ 53 “ίϛϡχέʔγϣϯΞϓϦʮLINEʯͷػೳվળΛࢧ͑ΔσʔλαΠΤϯε” LINE DEVELOPER DAY 2019 https://linedevday.linecorp.com/jp/2019/sessions/B1-3 গͷީิͰɺৄࡉͳݕূҙ֎ͳ΄ͲෳࡶʹͳΔ
ݕఆͷ܁Γฦ͠Կ͕͔ʁ 54 ݕఆ͞ΕΔԾઆ - ؼແԾઆ - ରཱԾઆ H0 H1 θ1
= θ2 = θ3 = θ4 ʢ̐܈ͷ߹ʣ {θi} i=1,2,3,4 ͷ͏ͪগͳ͘ͱ̍ͭͷϖΞͰ θi ≠ θj (i ≠ j) ࣮ߦతͳ༗ҙਫ४ Family-Wise Error Rate α = 1 − (1 − α)6 ≥ α α α 1 − (1 − α)6 શମͱͯ͠ݟͨ࣌ʹɺِཅੑ্͕͕ͬͯ͠·͏
Bonferroni ิਖ਼ 55 ֤ϖΞͷݕఆͷ༗ҙਫ४ΛɺݕఆͷճɹͰׂͬͨʹௐ͢Δ α → α m m Family-Wise
Error Rate α ≤ α ͱͳΓɺશମͱͯ͠ͷ༗ҙਫ४͕อͨΕΔ σϝϦοτ ͕େ͖͍ͱอकతʹͳΓ͕ͪʢِӄੑͷ্ঢʣ m
άϧʔϓ࡞ը໘ςετͰͷରॲ 56 “ίϛϡχέʔγϣϯΞϓϦʮLINEʯͷػೳվળΛࢧ͑ΔσʔλαΠΤϯε” LINE DEVELOPER DAY 2019 https://linedevday.linecorp.com/jp/2019/sessions/B1-3 - ࣄલݕূʹج͍ͮͯɺରΛ̏܈̎ϖΞʹߜΔ
- ̎ϖΞʹରͯ͠ Bonferroni ิਖ਼ͯ͠ݕఆ͢Δ α → α/2
άϧʔϓ࡞ը໘ςετͷ݁Ռ 57 “ίϛϡχέʔγϣϯΞϓϦʮLINEʯͷػೳվળΛࢧ͑ΔσʔλαΠΤϯε” LINE DEVELOPER DAY 2019 https://linedevday.linecorp.com/jp/2019/sessions/B1-3 ʮ̎ը໘ +
࠷ۙτʔΫͨ͠༑ͩͪϦετʯ → ࡞ྃΛҡ࣋ͭͭ͠ɺ࡞ͷॴཁ࣌ؒΛॖͨ͠
ଟॏൺֱͷରॲʹऔΓೖΕ͍ͯΔ͜ͱ 58 - جຊతʹൺֱରΛߴʑ̐ύλʔϯʹݶఆ͢Δ ̑ύλʔϯҎ্ੳղऍ͘͠ͳΔ - Bonferroni ิਖ਼ͰِཅੑΛ੍͢Δ ϦεΫͷ͋ΔςετͰِཅੑΛආ͚͍ͨ -
σϝϦοτΛิ͏ͨΊɺݕग़ྗɹΛߴΊʹઃఆ͢Δ β
3. ׳ΕޮՌͷਪఆ 59 ΦϯϥΠϯςετͷظؒ௨ৗ̎ʙ̏िؒ ظؒͷԠΛͦͷ··ड͚औͬͯΑ͍ͷͩΖ͏͔ʁ Ծઆ ಛʹྺ࢙͕͘श׳Խ͍ͯ͠Δػೳ΄Ͳɺ ը໘ͷมߋʹର͢ΔҰ࣌తͳԠ͕ݱΕΔ ՝ Ұ࣌తͳԠ͕ఆৗతͳར༻ʹམͪண͘
ʮ׳ΕޮՌʯΛݕग़͍ͨ͠
࣮ྫɿ༑ͩͪՃը໘ͷγϯϓϧԽςετ 60 “ίϛϡχέʔγϣϯΞϓϦʮLINEʯʹ͓͚Δ࣮ફతσʔλαΠΤϯε” DEIM 2020 https://engineering.linecorp.com/ja/blog/deim2020-report/ - ༑ͩͪՃը໘͔Βɺ༑ͩͪՃҎ֎ͷΞΠςϜΛআ͢Δ - ༑ͩͪՃ
& LINEެࣜΞΧϯτՃ͕ݮগͯ͠͠·ͬͨ
LINE ͷ৽نϢʔβʔͷΈʹݶఆͯ͠ܭଌͯ͠ΈΔ 61 - ༑ͩͪՃɾLINEެࣜΞΧϯτՃͱʹ༗ҙͳมԽͳ͠ - ԾઆɿશମʹطଘϢʔβʔ͕׳ΕΔ·ͰͷԠ͕ݱΕ͍ͯΔʁ “ίϛϡχέʔγϣϯΞϓϦʮLINEʯʹ͓͚Δ࣮ફతσʔλαΠΤϯε” DEIM 2020
https://engineering.linecorp.com/ja/blog/deim2020-report/
׳ΕޮՌΛࠩͷࠩͰϞσϧԽ͢Δ 62 1st half 2nd half Control Treatment yT,1 yC,1
yC,2 yT,2 ׳ΕޮՌҎ֎ͷӨڹ̎܈ͰಉҰ ʢฒߦτϨϯυ & ڞ௨γϣοΫͷԾఆʣ Ծఆ ςετظؒΛલɾޙʹ̎͢Δ ࠩͷࠩ౷ܭྔ δ = (yT,2 − yC,2) − (yT,1 − yC,1) ճؼϞσϧԽ ̂ β3 = ̂ δ y = β0 + β1 T + β2 S + β3 TS + ε T / C ͷμϛʔ 1st / 2nd ͷμϛʔ Ͱ͋Γɺ ճؼϞσϧͷͯ·Γ & ͷ༗ҙੑΛ֬ೝ͢Δ
༑ͩͪՃը໘ͷγϯϓϧԽςετͷ݁Ռ 63 - LINEެࣜΞΧϯτՃͷมԽʹɺ׳ΕޮՌ͕ݱΕ͍ͯͨ - LINEެࣜΞΧϯτՃͷͷΈআͯ͠ɺϦϦʔε͞Εͨ “ίϛϡχέʔγϣϯΞϓϦʮLINEʯʹ͓͚Δ࣮ફతσʔλαΠΤϯε” DEIM 2020 https://engineering.linecorp.com/ja/blog/deim2020-report/
ख๏Λඪ४Խͯ͠ਫฏల։͢Δ 64 “σʔλαΠΤϯε͕ಋ͘τʔΫϝχϡʔUIͷϦχϡʔΞϧϓϩδΣΫτ” LINE DEVELOPER DAY 2020 https://linedevday.linecorp.com/2020/ja/sessions/3932 - ׳ΕޮՌݕग़๏ɺτʔΫϝχϡʔͷϦχϡʔΞϧͰ׆༻ͨ͠
- ۀΛαΠΤϯεʹ͢ΔͨΊʹ - ܧଓ͢Δ
࣮ͰΑ͘༻͍Δ౷ܭͷҰ෦ͱ۩ମతͳࣄྫ 65 ΦϯϥΠϯ A/B ςετ 1. αϯϓϧαΠζͷܭࢉ 2. ଟॏൺֱ 3.
׳ΕޮՌͷਪఆ
͓ΘΓʹ 66
ʮσʔλαΠΤϯςΟετʯͷকདྷʁ 67 ࢲͷߟ͑ɿ - ʮσʔλαΠΤϯςΟετʯͱ͍͏ݺশظมΘ͍͔ͬͯ͘ - σʔλͱֶΛͬͯ՝Λղܾ͢Δͱ͍͏ཁ໘ന͞ ˠ ݺশͷਰΑΓͣͬͱ͘ଓͩ͘Ζ͏
ੈք͔ΒࣝΛநग़͢ΔαΠΫϧ 68 ੈք σʔλ ࣝ ॲཧ݁Ռ ࣝΛͲ͏ੈքʹϑΟʔυόοΫ͢Δ͔ʁ ੈքʹͲ͏͋ͬͯ΄͍͔͠ͱ͍͏Ձஅ ԿΛɺԿͷͨΊʹͲ͏؍ଌ͢Δ͔ʁ ܭଌʹ͢Δ͔൱͔ͱ͍͏Ձஅ
ରͷੈքΛݶఆ͢ΕࣗಈԽͷՄೳੑ͕͋Δ 69 ੈք σʔλ ࣝ ॲཧ݁Ռ ͷΓग़͠ͱγεςϜԽɺιϑτΣΞΤϯδχΞϦϯάͷྖҬ ʢྫʣهࣄͷਪનɺϚʔέςΟϯά ՁஅͷॏཁੑΓଓ͚Δ
ֶੜͷօ͞Μͷϝοηʔδ 70 ֶͼଓ͚·͠ΐ͏ औΓΈ·͠ΐ͏ - ֶ෦ɾେֶӃͰͷݚڀʢ՝ղܾͷαΠΫϧʣ ɹେֶੈքϨϕϧͷઐՈ͔ΒֶΔوॏͳॴ - ֶɾֶͷઐࣝ -
ϓϩάϥϛϯά - ޠֶ - ٕज़ྙཧɺ๏੍ɺྺ࢙ɺ… ৬໊τϐοΫͷྲྀߦʹͱΒΘΕ͗ͣ͢ɺ ઐࣝͰ՝ղܾͰ͖ΔਓΛͥͻࢦ͍ͯͩ͘͠͞