Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
統計的学習理論の基礎 II
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Masanari Kimura
March 05, 2021
Research
400
3
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
統計的学習理論の基礎 II
Masanari Kimura
March 05, 2021
More Decks by Masanari Kimura
See All by Masanari Kimura
Equivalence of Geodesics and Importance Weighting from the Perspective of Information Geometry
mkimura
0
370
機械学習における重要度重み付けとその応用
mkimura
3
3.4k
Paper Intro: Human Rademacher Complexity
mkimura
0
230
On the principle of Invariant Risk Minimization
mkimura
0
390
論文紹介:Clustering with Bregman Divergences: an Asymptotic Analysis
mkimura
0
620
Generalization Bounds for Set-to-Set Matching with Negative Sampling
mkimura
0
190
論文紹介:On the Importance of Gradients for Detecting Distributional Shifts in the Wild
mkimura
2
890
論文紹介:Dangers of Bayesian Model Averaging under Covariate Shift
mkimura
0
380
Information Geometry of Dropout Training
mkimura
0
350
Other Decks in Research
See All in Research
非試合日の野球場を楽しむためのARホームランボールキャッチ体験システムの開発 / EC79-miyazaki
yumulab
0
230
計算情報学研究室(数理情報学第7研究室)2026
tomohirokoana
0
560
事後確率分布の共分散について
koide3
0
120
ScoreMatchingRiesz for Automatic Debiased Machine Learning and Policy Path Estimation with an Application to Japanese Monetary Policy Evaluation
masakat0
0
290
明日から使える!研究効率化ツール入門
matsui_528
13
7.3k
CVPR2026論文紹介_VLMにとって良いvision encoderとは何か?Rethinking Model Selection in VLM Through the Lens of Gromov-Wasserstein Distance
kobayashi31
1
110
2026-01-30-MandSL-textbook-jp-cos-lod
yegusa
1
1.3k
第66回コンピュータビジョン勉強会@関東 Epona: Autoregressive Diffusion World Model for Autonomous Driving
kentosasaki
0
630
Apache Gravitinoで実現する Icebergカタログ統合とアクセスの一元化
matsumooon
0
280
Scalable dynamic origin-destination demand estimation enhanced by high-resolution satellite imagery data
satai
3
280
National high-resolution cropland classification of Japan with agricultural census information and multi-temporal multi-modality datasets
satai
3
290
敵対生成プロンプト同時探索による内省型プロンプト最適化
kinoue_smarthr
0
200
Featured
See All Featured
Why Our Code Smells
bkeepers
PRO
340
58k
Git: the NoSQL Database
bkeepers
PRO
432
67k
世界の人気アプリ100個を分析して見えたペイウォール設計の心得
akihiro_kokubo
PRO
71
40k
Statistics for Hackers
jakevdp
799
230k
The Web Performance Landscape in 2024 [PerfNow 2024]
tammyeverts
12
1.2k
Docker and Python
trallard
47
3.9k
The #1 spot is gone: here's how to win anyway
tamaranovitovic
2
1.1k
Product Roadmaps are Hard
iamctodd
PRO
55
12k
Agile Actions for Facilitating Distributed Teams - ADO2019
mkilby
0
210
Dominate Local Search Results - an insider guide to GBP, reviews, and Local SEO
greggifford
PRO
0
190
Principles of Awesome APIs and How to Build Them.
keavy
128
18k
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
47
8.2k
Transcript
CompML ౷ܭతֶशཧͷجૅ II Masanari Kimura (@machinery81)
CompML TL;DR • ౷ܭతֶशཧͷجૅతͳࣄ߲ͷ·ͱΊ • ୈೋճҎԼͷτϐοΫʹ͍ͭͯ • ू߹ͷ֓೦ • VC-Dimension
• Pseudo-Dimension • Fat-Shattering Dimension • VCόϯυ 2
CompML VC-Dimension
CompML VC-Dimension ఆٛ 1.ʢVC-࣍ݩʣՄଌۭؒ ͷ͋Δू߹Λ ͱ͢Δɽશͯͷ෦ू߹ ʹ͍ͭͯɼ ͱͳΔΑ͏ͳ ͕ଘࡏ͢Δͱ͖ɼू߹
Ͱ͞ ΕΔͱ͍͏ɽ ͷVapnik-Chervonenkis࣍ݩ ɼ ʹΑͬͯ͞ΕΔू ߹ͷجͷ࠷େʹ͍͠ɽ (𝑋, 𝑆) 𝒜 ⊂ 𝑆 𝐵 ⊂ 𝑆 𝑆 ∩ 𝐴 = 𝐵 𝐴 ∈ 𝒜 𝑆 𝒜 𝒜 𝑉𝐶𝑑𝑖𝑚(𝒜) 𝒜 Photo by Wikipedia.
CompML The Pseudo-Dimension ఆٛ2.ʢ -࣍ݩʣՄଌۭؒ ͷ্ͷՄଌؔͷू߹Λ ͱ͢Δɽ ू߹ ҎԼ͕Γཱͭͱ͖ -shatteredͰ͋Δͱ͍͏ɿ
ҙͷ2ϕΫτϧ ͱͦΕʹରԠ͢Δؔ ʹ͍ͭͯɼ ্هͷ݅ΛHeavisideؔ Ͱॻ͖͑Δͱ ؔΫϥε ͷ -࣍ݩ ʹΑͬͯ -shatteredͱͳΔΑ͏ͳू߹ͷجͷ࠷େͰఆٛ͞Εɼ ͱॻ͔ΕΔɽ 𝑃 (𝑋, 𝑆 ) ℱ ⊂ [0,𝑅] 𝑋 𝑆 = {𝑥1 , …, 𝑥𝑛} ⊂ 𝑋 𝑃 𝑒 ∈ {0,1}𝑛 𝑓𝑒 ∈ ℱ { 𝑓𝑒(𝑥𝑖) ≥ 𝑐𝑖 𝑖𝑓 𝑒𝑖 = 1, 𝑓𝑒(𝑥𝑖) < 𝑐𝑖 𝑖𝑓 𝑒𝑖 = 0. 𝜂(𝑧) 𝜂[𝑓𝑒(𝑥𝑖) − 𝑐𝑖] = 𝑒𝑖 , ∀𝑖, ∀𝑒 . ℱ 𝑃 ℱ 𝑃 𝑃𝑑𝑖𝑚(ℱ)
CompML Illustration of P-Shattering 𝑥1 𝑥2 𝑥3 𝑓 [01…1] 𝑓
[00…1] 𝑓 [11…0] 𝑐1 𝑐2 𝑐3 { 𝑓𝑒(𝑥𝑖) ≥ 𝑐𝑖 𝑖𝑓 𝑒𝑖 = 1, 𝑓𝑒(𝑥𝑖) < 𝑐𝑖 𝑖𝑓 𝑒𝑖 = 0.
CompML VC࣍ݩͱ -࣍ݩͷಉ݅ 𝑃 ิ1ɽ ʹ͍ͭͯɼҎԼͷΑ͏ʹ Λఆٛ͢Δɿ ͜ͷͱ͖ɼ ℱ =
{𝑓:𝑋 → [0,𝑅]} ¯ ℱ ¯ ℱ = { ¯ 𝑓(𝑥, 𝑐) = 𝜂[𝑓(𝑥) − 𝑐] :𝑓 ∈ ℱ} . 𝑃𝑑𝑖𝑚( ¯ ℱ) = 𝑉𝐶𝑑𝑖𝑚( ¯ ℱ) .
CompML The Fat-Shattering Dimension ఆٛɽʢFat-Shattering࣍ݩʣ Մଌۭؒ ͷ্ͷՄଌؔͷू߹Λ ͱ͢Δɽू߹ Ҏ Լ͕Γཱͭͱ͖෯
͓Αͼਫ਼ Ͱfat-shatteredͰ͋Δͱ͍͏ɿ ҙͷ2ϕΫτϧ ͱͦΕʹରԠ͢Δؔ ʹ͍ͭͯɼ ؔΫϥε ͷFat-Shattering࣍ݩ ʹΑͬͯfat-shatteredͱͳΔΑ͏ͳू߹ͷج ͷ࠷େͰఆٛ͞Εɼ ͱॻ͔ΕΔɽ (𝑋, 𝑆) ℱ ⊂ [0,𝑅] 𝑋 S = {x1 , …, xn } γ c 𝑒 ∈ {0,1}𝑛 𝑓𝑒 ∈ ℱ { fe (xi ) ≥ ci + γ if ei = 1, fe (xi ) < ci − γ if ei = 0. ℱ ℱ Fdim(ℱ, γ)
CompML VC Generalization Bound ఆཧɽظޡࠩ ͓Αͼܦݧޡࠩ ʹ͍ͭͯɼVC࣍ݩΛ ͱॻ͘ͱɼ ͕ຬ͞ΕΔɽ ൚Խޡ͕ࠩVC࣍ݩΛ༻͍ͯ͑ΒΕΔɽ
R(h) ̂ R(h) dVC R(h) − ̂ R(h) ≤ 8dVC(ln 2m dVC + 1) + 8 ln 4 δ m
CompML LemmaʢSymmetrizationʣ ิɽ ͱͳΔΑ͏ͳ ʹ͍ͭͯɼ ͕Γཱͭɽ͜͜Ͱ ؔͷظͱܦݧͷࠩɼಠཱʹಘΒΕͨೋछྨͷܦݧͷࠩͰ͑ΒΕΔɽ t ≥ 2/m
t > 0 P( sup f∈ℱ | f − ̂ f | ) ≤ 2P( sup f∈ℱ | ̂ f′ − ̂ f | ≥ t/2) f = 𝔼[ f ] ̂ f = 1 m m ∑ i=1 f(xi , yi ) ̂ f′ = 1 m m ∑ i=1 f(x′ i , y′ i )
CompML ࢀߟจݙ • Shalev-Shwartz, S., Ben-David, S. (2014). Understanding Machine
Learning - From Theory to Algorithms.. Cambridge University Press. ISBN: 978-1-10-705713-5 • Mohri, Mehryar, Afshin Rostamizadeh, and Ameet Talwalkar. Foundations of machine learning. MIT press, 2018.