Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
P 値と有意差/分散分析 / P-value, Significant Difference ...
Search
Kenji Saito
PRO
January 03, 2025
Technology
0
65
P 値と有意差/分散分析 / P-value, Significant Difference and Analysis of Variance
早稲田大学大学院経営管理研究科「企業データ分析」2024 冬の第9-10回で使用したスライドです。
Kenji Saito
PRO
January 03, 2025
Tweet
Share
More Decks by Kenji Saito
See All by Kenji Saito
続・インクルーシブな社会へ / Continuing Towards an Inclusive Society
ks91
PRO
0
11
AGI (人工一般知能) と創る新しく奇妙な社会 / New and Stranger Society built with AGI
ks91
PRO
0
55
回帰分析/大規模言語モデルと統計 / Regression Analysis, Large Language Models and Statistics
ks91
PRO
0
61
多重比較/相関分析 / Multiple Comparison and Correlation Analysis
ks91
PRO
0
61
アカデミーキャンプ 2025冬「考えるのは奴らだ」 / Academy Camp 2025 Winter - Live and Let Think DAY 3
ks91
PRO
0
56
アカデミーキャンプ 2025冬「考えるのは奴らだ」 / Academy Camp 2025 Winter - Live and Let Think DAY 2
ks91
PRO
0
43
アカデミーキャンプ 2025冬「考えるのは奴らだ」 / Academy Camp 2025 Winter - Live and Let Think DAY 1
ks91
PRO
1
68
インクルーシブな社会へ / Toward an Inclusive Society
ks91
PRO
0
14
関連2群のt検定/独立2群のt検定 / Related 2-group t-test and independent 2-group t-test
ks91
PRO
0
73
Other Decks in Technology
See All in Technology
2024.02.19 W&B AIエージェントLT会 / AIエージェントが業務を代行するための計画と実行 / Algomatic 宮脇
smiyawaki0820
10
1.5k
ハッキングの世界に迫る~攻撃者の思考で考えるセキュリティ~
nomizone
13
5k
利用終了したドメイン名の最強終活〜観測環境を育てて、分析・供養している件〜 / The Ultimate End-of-Life Preparation for Discontinued Domain Names
nttcom
1
120
関東Kaggler会LT: 人狼コンペとLLM量子化について
nejumi
3
540
あれは良かった、あれは苦労したB2B2C型SaaSの新規開発におけるCloud Spanner
hirohito1108
2
370
7日間でハッキングをはじめる本をはじめてみませんか?_ITエンジニア本大賞2025
nomizone
2
1.7k
Helm , Kustomize に代わる !? 次世代 k8s パッケージマネージャー Glasskube 入門 / glasskube-entry
parupappa2929
0
210
開発組織のための セキュアコーディング研修の始め方
flatt_security
3
1.4k
AndroidXR 開発ツールごとの できることできないこと
donabe3
0
120
「海外登壇」という 選択肢を与えるために 〜Gophers EX
logica0419
0
640
現場の種を事業の芽にする - エンジニア主導のイノベーションを事業戦略に装着する方法 -
kzkmaeda
2
1.8k
人はなぜISUCONに夢中になるのか
kakehashi
PRO
6
1.5k
Featured
See All Featured
Easily Structure & Communicate Ideas using Wireframe
afnizarnur
193
16k
Creating an realtime collaboration tool: Agile Flush - .NET Oxford
marcduiker
27
1.9k
How to train your dragon (web standard)
notwaldorf
91
5.8k
Designing on Purpose - Digital PM Summit 2013
jponch
117
7.1k
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
99
18k
Being A Developer After 40
akosma
89
590k
Designing for Performance
lara
604
68k
Designing Dashboards & Data Visualisations in Web Apps
destraynor
231
53k
Performance Is Good for Brains [We Love Speed 2024]
tammyeverts
7
630
A Philosophy of Restraint
colly
203
16k
What's in a price? How to price your products and services
michaelherold
244
12k
Java REST API Framework Comparison - PWX 2021
mraible
28
8.4k
Transcript
Corporate data analysis — generated by Stable Diffusion XL v1.0
2024 9-10 P (WBS) 2024 9-10 P — 2025-01-06 – p.1/33
https://speakerdeck.com/ks91/collections/corporate-data-analysis-2024-winter 2024 9-10 P — 2025-01-06 – p.2/33
( ) 1 12 2 • 2 12 2 (B
A ) • 3 12 9 • 4 12 9 • 5 12 16 • 6 12 16 t • 7 12 23 2 ( ) t • 8 12 23 2 ( ) t • 9 1 6 P • 10 1 6 • 11 1 20 12 1 20 13 1 27 14 1 27 W-IOI 2024 9-10 P — 2025-01-06 – p.3/33
( 20 25 ) 1 (20 ) • 2 R
( 55 ) • 3 (32 ) • 4 (14 ) • 5 ( Git) (22 ) • 6 ( ) (24 ) • 7 (1) (25 ) • 8 (2) (25 ) • 9 R ( ) (1) — Welch (17 ) • 10 R ( ) (2) — (21 ) • 11 R ( ) (1) — (15 ) • 12 R ( ) (2) — (19 ) • 13 GPT-4 (19 ) • 14 GPT-4 (29 ) • 15 ( ) LaTeX Overleaf (40 ) • 8 (12/16 ) / (2 ) OK / 2024 9-10 P — 2025-01-06 – p.4/33
( Student µ 95% ) 7 2 t ( t
) 2 ( ) 2 d ( ) ← [ 3] σd 2 t 8 2 t ( t ) 2 ( ) ( ) ← [ 4] σ 2 t 2024 9-10 P — 2025-01-06 – p.5/33
2 2 t 1 9 P P 10 H0 HA
k, N, ¯ ¯ x σ2 ( )MSwithin ( )MSbetween MStotal F F 2024 9-10 P — 2025-01-06 – p.6/33
2024 9-10 P — 2025-01-06 – p.7/33
4. t (1) 2 t (2) 2 t (3) 2025
1 2 ( ) 23:59 JST ( ) Waseda Moodle (Q & A ) (1)(2) Discord 2024 9-10 P — 2025-01-06 – p.8/33
. . . . . . 17 14 (1/3( )
) ( ) → 14 ( ) ( ) → 6 → 3 ( ) → 5 ( ) ( OK) 2 t . . . . . . / . . . ( ) 2024 9-10 P — 2025-01-06 – p.9/33
t t ⇒ ( ) A A xA 2 B
B xB 2 df . . . ⇒ t σ z0.05 . . . ⇒ ( ) t 2024 9-10 P — 2025-01-06 – p.10/33
N (1/2) 2 t 2 2 “ ” 1. 1
2 2. 3. - 2 (n − 1) 4. ÷ ÷ t 5. t (n − 1) t t ⇒ . . . 0 ( ) 2024 9-10 P — 2025-01-06 – p.11/33
N (2/2) 2 t 2 2 1 2 1 2
2 “ ” 1. 2 1 2 2. 3. ( -2) 4. t 1÷ 2 t 5. t (n1 + n2 − 2) t t ⇒ 2024 9-10 P — 2025-01-06 – p.12/33
M ( ) [ 2 t ] 1Day 1Day 1Day
⇒ 2024 9-10 P — 2025-01-06 – p.13/33
K ⇒ . . . 2024 9-10 P — 2025-01-06
– p.14/33
2 t d : µd 0 ( 2 ) :
(1) d d, sd , n, df (2) |d| sd n |t| (3) t0.05 (df) < |t| ( ) R > t.test(sample2, sample1, paired=T) 2024 9-10 P — 2025-01-06 – p.15/33
2 t ( ) 10 ( ) ( ) (
) ( ) ( ) d ( ) d, ( ) sd , ( ) n, ( ) df ( ) t ( ) t ( ) d ( ) sd ( ) n ( ) t df 5% ( ) ( ) ( ) ( ) ( ) 2024 9-10 P — 2025-01-06 – p.16/33
2 t xA xB : µA − µB 0 (
2 ) : (1) xA − xB , sp , nA nB , df (2) |xA − xB | sp nA nB |t| (3) t0.05 (df) < |t| ( ) R > t.test(sample2, sample1, var.equal=T) 2024 9-10 P — 2025-01-06 – p.17/33
2 t ( ) ( ) ( ) ( )
( ) ( ) ( ( ) A B ( ) ) ( ) xA − xB , A B ( ) ( ) sp , ( ) nA nB , ( ) df = nA + nB − 2 ( ) t ( ) t ( ) xA − xB ( ) sp ( ) nA ,nB ( ) t df 5% ( ) ( ) ( ) ( ) ( ) ( ) 2024 9-10 P — 2025-01-06 – p.18/33
K 2 t ( ) ⇒ 2 2024 9-10 P
— 2025-01-06 – p.19/33
N ⇒ (σ) ( σ √n ) ( ) p.121
(standard error) (p.121) (sampling distribution) (p.120) (p.120) ( : ) 2024 9-10 P — 2025-01-06 – p.20/33
K ⇒ . . . AI ( ) . .
. ^^; ( ) 2024 9-10 P — 2025-01-06 – p.21/33
H t 2 Student t t 1 sin(α + β)
= sinαcosβ + cosαsinβ . . . ⇒ 2024 9-10 P — 2025-01-06 – p.22/33
U R ChatGPT ⇒ AI ( ) 2024 9-10 P
— 2025-01-06 – p.23/33
9 P P 2024 9-10 P — 2025-01-06 – p.24/33
α β P P H0 ( ) P 0.05 (P
= 0.015) (P = 0.361) 2024 9-10 P — 2025-01-06 – p.25/33
10 H0 HA k, N, ¯ ¯ x σ2 (
) MSwithin ( )MSbetween MStotal ( SStotal dftotal ) F F 2024 9-10 P — 2025-01-06 – p.26/33
(1/3) k (1) : (2) : σ2 ( ) N(µ,
σ2) µ1 = µ2 = · · · = µk N ( ) ¯ ¯ x ¯ ¯ x = k j=1 nj i=1 xji N (j i N ) 2024 9-10 P — 2025-01-06 – p.27/33
(2/3) ( )MSwithin σ2 MSwithin = SSwithin dfwithin = k
j=1 nj i=1 (xji − ¯ xj )2 N − k ( N− ) ( )MSbetween σ2 MSbetween = SSbetween dfbetween = k j=1 nj (¯ xj − ¯ ¯ x)2 k − 1 ( −1 ) ( H0 σ2 ) 2024 9-10 P — 2025-01-06 – p.28/33
(3/3) MStotal MStotal = SStotal dftotal = k j=1 nj
i=1 (xji − ¯ ¯ x)2 N − 1 ( N − 1 ) : SStotal = SSbetween + SSwithin, dftotal = dfbetween + dfwithin F F = MSbetween MSwithin F0.05 (dfbetween, dfwithin ) < F ( H0 ) 2024 9-10 P — 2025-01-06 – p.29/33
U ( p.227) 20 4 “ U.R” ( anova() )
pp.226–227 2024 9-10 P — 2025-01-06 – p.30/33
2024 9-10 P — 2025-01-06 – p.31/33
5. (1) ( ) (2) 2025 1 16 ( )
23:59 JST ( ) Waseda Moodle (Q & A ) (1)(2) Discord 2024 9-10 P — 2025-01-06 – p.32/33
2024 9-10 P — 2025-01-06 – p.33/33