Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Autoencoding Variational Inference for Topic Mo...
Search
Kento Nozawa
June 15, 2017
Research
30k
3
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Autoencoding Variational Inference for Topic Modelsの解説スライド
ICLR2017読み会のスライド
https://connpass.com/event/57631/
Kento Nozawa
June 15, 2017
More Decks by Kento Nozawa
See All by Kento Nozawa
Analysis on Negative Sample Size in Contrastive Unsupervised Representation Learning
nzw0301
0
210
[IJCAI-ECAI 2022] Evaluation Methods for Representation Learning: A Survey
nzw0301
0
680
[NeurIPS Japan meetup 2021 talk] Understanding Negative Samples in Instance Discriminative Self-supervised Representation Learning
nzw0301
0
260
[IBIS2021] 対照的自己教師付き表現学習おける負例数の解析
nzw0301
0
220
Understanding Negative Samples in Instance Discriminative Self-supervised Representation Learning
nzw0301
0
550
Introduction of PAC-Bayes and its Application for Contrastive Unsupervised Representation Learning
nzw0301
2
900
NLP Tutorial; word representation learning
nzw0301
0
260
Analyzing Centralities of Embedded Nodes
nzw0301
0
220
Paper Reading: Noise-Contrastive Estimation of Unnormalized Statistical Models, with Applications to Natural Image Statistics
nzw0301
2
1.3k
Other Decks in Research
See All in Research
2026-01-30-MandSL-textbook-jp-cos-lod
yegusa
1
1.3k
Model Discovery and Graph Simulation: A Lightweight Gateway to Chaos Engineering
anatolykr
0
200
LINEヤフー データサイエンス Meetup「三井物産コモディティ予測チャレンジ」の舞台裏-AlpacaTechパート
gamella
1
570
SAKURAONE:An Open Ethernet-based AI HPC System And Its Observed Workload Dynamicsin a Single-Tenant LLM Development Environment
yuukit
1
330
The mathematics of transformers
gpeyre
0
330
世界モデルにおける分布外データ対応の方法論
koukyo1994
7
2.2k
ブレグマン距離最小化に基づくリース表現量推定:バイアス除去学習の統一理論
masakat0
0
280
多様なデータを許容し学習し続ける模倣学習 / Advanced Imitation Learning for VLA
prinlab
0
220
長時間動画QAにおけるマルチエージェント推論 ・SVAgent: Storyline-Guided Long Video Understanding via Cross-Modal Multi-Agent Collaboration
murakawatakuya
1
120
AIで最適化を解けるか?
mickey_kubo
0
120
重要だけど測れていないもの:高齢者ケアの見えない課題
theoriatec2024
0
350
コーディングエージェントとABNを再考
hf149
2
710
Featured
See All Featured
What Being in a Rock Band Can Teach Us About Real World SEO
427marketing
0
250
How to Ace a Technical Interview
jacobian
281
24k
Self-Hosted WebAssembly Runtime for Runtime-Neutral Checkpoint/Restore in Edge–Cloud Continuum
chikuwait
0
590
Agile that works and the tools we love
rasmusluckow
331
21k
Leading Effective Engineering Teams in the AI Era
addyosmani
9
2.1k
Principles of Awesome APIs and How to Build Them.
keavy
128
18k
Prompt Engineering for Job Search
mfonobong
0
340
Making the Leap to Tech Lead
cromwellryan
135
9.9k
Learning to Love Humans: Emotional Interface Design
aarron
275
41k
4 Signs Your Business is Dying
shpigford
187
22k
Leo the Paperboy
mayatellez
7
1.8k
Building Experiences: Design Systems, User Experience, and Full Site Editing
marktimemedia
0
530
Transcript
Autoencoding Variational Inference For Topic Models Akash Srivastava and Charles
Sutton ICLR2017ಡΈձ ಡΉਓ: @nzw0301
֓ཁ 1. Latent Dirichlet Allocation (LDA) ΛNeural Variational Inference (NVI)
Ͱ • Dirichlet ͷ reparameterization trick 2. ৽ϞσϧͷఏҊ 3. ѱ͍ہॴղʹϋϚΔͷΛ༧ 2
ࣄલࣝɿLDAͱVAEͷ֓ཁ 3
LDA จॻͷ֬తੜϞσϧ [Blei et al., 2003]
จॻͷτϐοΫQ [cВ ݚڀ ՝ ࣝ Պֶऀ ʜ ػցֶश ਓೳ Ϟσϧ αϯϓϧ ʜ τϐοΫͷ୯ޠ p(w|β) Ќ Ќ ػցֶश ػցֶशݚڀ ਓೳ՝ Ϟσϧ-%" Պֶֶण࢘ ίʔύε 4
VAE: Encoder • NNΛͬͨੜϞσϧ • Encoder: • σʔλ͔Β֬ͷύϥϝʔλͷม • ֬જࡏมΛੜ
• Decoder: • જࡏม͔Βσʔλੜ • Reparameterization trick • BPʹαϯϓϧΛؚΊΔ • ඪ४ਖ਼نͷαϯϓϧͱͷ ύϥϝʔλ͔ΒαϯϓϧΛߏ 5
VAE: Decoder • NNΛͬͨੜϞσϧ • Encoder: • σʔλ͔Β֬ͷύϥϝʔλͷม • ֬જࡏมΛੜ
• Decoder: • જࡏม͔Βσʔλੜ • Reparameterization trick • BPʹαϯϓϧΛؚΊΔ • ඪ४ਖ਼نͷαϯϓϧͱͷ ύϥϝʔλ͔ΒαϯϓϧΛߏ 6
VAE: Reparameterization trick • NNΛͬͨੜϞσϧ • Encoder: • σʔλ͔Β֬ͷύϥϝʔλͷม •
֬જࡏมΛੜ • Decoder: • જࡏม͔Βσʔλੜ • Reparameterization trick • BPʹαϯϓϧΛؚΊΔ • ඪ४ਖ਼نͷαϯϓϧͱͷ ύϥϝʔλ͔ΒαϯϓϧΛߏ 7
VAE: ϩεؔ 8 L (⇥) = D X d=1 (
1 2 ⇣ tr (⌃0) + µT 0 µ0 K log | ⌃0 | ⌘ + E ✏⇠N (0,1) ⇣ log p xd |f ( µ0 + ⌃ 1/2 0 ✏ ) ⌘ ) (Ⅰ) ࣄલͱͷKLμΠόʔδΣϯε (Ⅱ) ର ࣜશମ: Evidence Lower Bound (I) (Ⅱ)
ຊ 9
Reparameterization trick for Dirichlet Distribution • LDAͷθ: Dirichlet͔Βαϯϓϧ • Scale
family DistributionͰͳ͍ͨΊɼߏͰ͖ͳ͍ 10 จॻͷτϐοΫQ [cВ
Reparameterization trick for Dirichlet Distribution • LDAͷθ: Dirichlet͔Βαϯϓϧ • Scale
family DistributionͰͳ͍ͨΊɼߏͰ͖ͳ͍ • Laplace approximation • ਖ਼نͷαϯϓϧʹsoftmaxؔΛద༻ͯ͠༻ • ࣄલͷύϥϝʔλɿ µk = log( ↵k) 1 K K X i=1 log ↵i ⌃k,k = 1 ↵k (1 2 K ) + 1 K2 K X i=1 1 ↵k 11
ωοτϫʔΫͱϩεؔ 12 X encoder µ( X ) ⌃ ( X
) KL {N( z ; µ( X ) , ⌃ ( X ))||N( z ; µ1, ⌃1)} ✏ ⇠ N(✏; 0, I ) + decoder: f ( Z ) loss ( x, f ( Z )) • σ: softmaxؔ • β : DecoderͷॏΈʢunnormalizedʣ • σ(β): ୯ޠͷDiriclet͔ΒͷαϯϓϧʹରԠ L ( ⇥ ) = D X d=1 ( 1 2 ⇣ tr ( ⌃ 1 1 ⌃0) + ( µ1 µ0) T ⌃ 1 1 ( µ1 µ0) K + log |⌃1 | |⌃0 | ⌘ + E ✏⇠N (0,1) wt d log ⇣ ( µ0 + ⌃1/2 0 ✏ ) ⌘ !) θ සϕΫτϧ
prodLDA: ఏҊϞσϧ • Products of Experts • βͱθͷੵʹsoftmaxؔ 13 L
( ⇥ ) = D X d=1 ( 1 2 ⇣ tr ( ⌃ 1 1 ⌃0) + ( µ1 µ0) T ⌃ 1 1 ( µ1 µ0) K + log |⌃1 | |⌃0 | ⌘ + E ✏⇠N (0,1) wt d log ⇣ ( µ0 + ⌃1/2 0 ✏ ) ⌘ !) ( ✓)
࠷దԽͱωοτϫʔΫͷ NVIͷɿ ֶशͷॳظஈ֊Ͱlocal optimumʹߦ͖͍͢ • AdamͷύϥϝʔλΛௐ • ηͱβ1 ͷͷߴΊʹઃఆ •
Batch NormalizationͱDropoutΛ༻ 14
࣮ݧ 1. CoherenceͱPerplexity • ޙड़ 2. ֶशͱࣄલΛม͑ͨͱ͖ͷޮՌ • ߴֶ͍श &
Dirichlet͕ϕλʔ 3. ςετσʔλʹର͢Δ࠷దԽͷ༗ແ • ͠ͳ͍͍ͯ͘ 4. p(w|β)ͷϦετ • লུ 15
Coherence 16 දจ͔ΒҾ༻ • LDA VAE: ఏҊਪ๏ • prodLDA: ఏҊਪ๏+ఏҊϞσϧ
• LDA DMFVI: Online Mean-Field Variational Inference • NVDM: VAEϕʔεͷจॻϞσϦϯά දͷ: 40ճ࣮ߦͯ͠ࢉग़
Perplexity 17 දจ͔ΒҾ༻
ϨϏϡʔ: ؾʹͳͬͨͷΛ͍͔ͭ͘ Q1. NVDMͰadamͷֶशΛม͑ͨํ͕ެฏ A1. จʹө Q2. ϋΠύʔύϥϝʔλ࠷దԽ͔ͨ͠ A2. ൺֱख๏͍ͯ͠ΔɼఏҊख๏BO
Rating: 6-7-6-5 18
ͦͷଞ • ஶऀ࣮: TensorFlow • NVDMͷஶऀΒͷ৽Ϟσϧ͕ICML2017ʹ࠾ 19