Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Autoencoding Variational Inference for Topic Mo...
Search
Kento Nozawa
June 15, 2017
Research
30k
3
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Autoencoding Variational Inference for Topic Modelsの解説スライド
ICLR2017読み会のスライド
https://connpass.com/event/57631/
Kento Nozawa
June 15, 2017
More Decks by Kento Nozawa
See All by Kento Nozawa
Analysis on Negative Sample Size in Contrastive Unsupervised Representation Learning
nzw0301
0
210
[IJCAI-ECAI 2022] Evaluation Methods for Representation Learning: A Survey
nzw0301
0
680
[NeurIPS Japan meetup 2021 talk] Understanding Negative Samples in Instance Discriminative Self-supervised Representation Learning
nzw0301
0
260
[IBIS2021] 対照的自己教師付き表現学習おける負例数の解析
nzw0301
0
220
Understanding Negative Samples in Instance Discriminative Self-supervised Representation Learning
nzw0301
0
550
Introduction of PAC-Bayes and its Application for Contrastive Unsupervised Representation Learning
nzw0301
2
900
NLP Tutorial; word representation learning
nzw0301
0
260
Analyzing Centralities of Embedded Nodes
nzw0301
0
220
Paper Reading: Noise-Contrastive Estimation of Unnormalized Statistical Models, with Applications to Natural Image Statistics
nzw0301
2
1.3k
Other Decks in Research
See All in Research
IEEE AIxVR 2026 Keynote Talk: "Beyond Visibility: Understanding Scenes and Humans under Challenging Conditions with Diverse Sensing"
miso2024
0
200
長時間動画QAにおけるマルチエージェント推論 ・SVAgent: Storyline-Guided Long Video Understanding via Cross-Modal Multi-Agent Collaboration
murakawatakuya
1
120
The mathematics of transformers
gpeyre
0
330
RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent
satai
2
300
Sequences of Logits Reveal the Low Rank Structure of Language Models
sansantech
PRO
1
260
Cross-Media Information Spaces and Architectures
signer
PRO
0
300
LLM の Attention 機構まとめ — 数式・計算量・メモリ
puwaer
8
2.1k
Using our influence and power for patient safety
helenbevan
0
360
量子コンピュータの紹介
oqtopus
0
330
論文紹介 "ReSim: Reliable World Simulation for Autonomous Driving"
kogo
0
630
「なんとなく」の顧客理解から脱却する ──顧客の解像度を武器にするインサイトマネジメント
tajima_kaho
10
7.6k
通時的な類似度行列に基づく単語の意味変化の分析
rudorudo11
0
310
Featured
See All Featured
RailsConf 2023
tenderlove
30
1.5k
Build The Right Thing And Hit Your Dates
maggiecrowley
39
3.2k
Facilitating Awesome Meetings
lara
57
7k
Leading Effective Engineering Teams in the AI Era
addyosmani
9
2.1k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
46
2.9k
Principles of Awesome APIs and How to Build Them.
keavy
128
18k
Mind Mapping
helmedeiros
PRO
1
250
Conquering PDFs: document understanding beyond plain text
inesmontani
PRO
4
2.8k
Color Theory Basics | Prateek | Gurzu
gurzu
0
360
Crafting Experiences
bethany
1
180
Producing Creativity
orderedlist
PRO
348
40k
[SF Ruby Conf 2025] Rails X
palkan
2
1.1k
Transcript
Autoencoding Variational Inference For Topic Models Akash Srivastava and Charles
Sutton ICLR2017ಡΈձ ಡΉਓ: @nzw0301
֓ཁ 1. Latent Dirichlet Allocation (LDA) ΛNeural Variational Inference (NVI)
Ͱ • Dirichlet ͷ reparameterization trick 2. ৽ϞσϧͷఏҊ 3. ѱ͍ہॴղʹϋϚΔͷΛ༧ 2
ࣄલࣝɿLDAͱVAEͷ֓ཁ 3
LDA จॻͷ֬తੜϞσϧ [Blei et al., 2003]
จॻͷτϐοΫQ [cВ ݚڀ ՝ ࣝ Պֶऀ ʜ ػցֶश ਓೳ Ϟσϧ αϯϓϧ ʜ τϐοΫͷ୯ޠ p(w|β) Ќ Ќ ػցֶश ػցֶशݚڀ ਓೳ՝ Ϟσϧ-%" Պֶֶण࢘ ίʔύε 4
VAE: Encoder • NNΛͬͨੜϞσϧ • Encoder: • σʔλ͔Β֬ͷύϥϝʔλͷม • ֬જࡏมΛੜ
• Decoder: • જࡏม͔Βσʔλੜ • Reparameterization trick • BPʹαϯϓϧΛؚΊΔ • ඪ४ਖ਼نͷαϯϓϧͱͷ ύϥϝʔλ͔ΒαϯϓϧΛߏ 5
VAE: Decoder • NNΛͬͨੜϞσϧ • Encoder: • σʔλ͔Β֬ͷύϥϝʔλͷม • ֬જࡏมΛੜ
• Decoder: • જࡏม͔Βσʔλੜ • Reparameterization trick • BPʹαϯϓϧΛؚΊΔ • ඪ४ਖ਼نͷαϯϓϧͱͷ ύϥϝʔλ͔ΒαϯϓϧΛߏ 6
VAE: Reparameterization trick • NNΛͬͨੜϞσϧ • Encoder: • σʔλ͔Β֬ͷύϥϝʔλͷม •
֬જࡏมΛੜ • Decoder: • જࡏม͔Βσʔλੜ • Reparameterization trick • BPʹαϯϓϧΛؚΊΔ • ඪ४ਖ਼نͷαϯϓϧͱͷ ύϥϝʔλ͔ΒαϯϓϧΛߏ 7
VAE: ϩεؔ 8 L (⇥) = D X d=1 (
1 2 ⇣ tr (⌃0) + µT 0 µ0 K log | ⌃0 | ⌘ + E ✏⇠N (0,1) ⇣ log p xd |f ( µ0 + ⌃ 1/2 0 ✏ ) ⌘ ) (Ⅰ) ࣄલͱͷKLμΠόʔδΣϯε (Ⅱ) ର ࣜશମ: Evidence Lower Bound (I) (Ⅱ)
ຊ 9
Reparameterization trick for Dirichlet Distribution • LDAͷθ: Dirichlet͔Βαϯϓϧ • Scale
family DistributionͰͳ͍ͨΊɼߏͰ͖ͳ͍ 10 จॻͷτϐοΫQ [cВ
Reparameterization trick for Dirichlet Distribution • LDAͷθ: Dirichlet͔Βαϯϓϧ • Scale
family DistributionͰͳ͍ͨΊɼߏͰ͖ͳ͍ • Laplace approximation • ਖ਼نͷαϯϓϧʹsoftmaxؔΛద༻ͯ͠༻ • ࣄલͷύϥϝʔλɿ µk = log( ↵k) 1 K K X i=1 log ↵i ⌃k,k = 1 ↵k (1 2 K ) + 1 K2 K X i=1 1 ↵k 11
ωοτϫʔΫͱϩεؔ 12 X encoder µ( X ) ⌃ ( X
) KL {N( z ; µ( X ) , ⌃ ( X ))||N( z ; µ1, ⌃1)} ✏ ⇠ N(✏; 0, I ) + decoder: f ( Z ) loss ( x, f ( Z )) • σ: softmaxؔ • β : DecoderͷॏΈʢunnormalizedʣ • σ(β): ୯ޠͷDiriclet͔ΒͷαϯϓϧʹରԠ L ( ⇥ ) = D X d=1 ( 1 2 ⇣ tr ( ⌃ 1 1 ⌃0) + ( µ1 µ0) T ⌃ 1 1 ( µ1 µ0) K + log |⌃1 | |⌃0 | ⌘ + E ✏⇠N (0,1) wt d log ⇣ ( µ0 + ⌃1/2 0 ✏ ) ⌘ !) θ සϕΫτϧ
prodLDA: ఏҊϞσϧ • Products of Experts • βͱθͷੵʹsoftmaxؔ 13 L
( ⇥ ) = D X d=1 ( 1 2 ⇣ tr ( ⌃ 1 1 ⌃0) + ( µ1 µ0) T ⌃ 1 1 ( µ1 µ0) K + log |⌃1 | |⌃0 | ⌘ + E ✏⇠N (0,1) wt d log ⇣ ( µ0 + ⌃1/2 0 ✏ ) ⌘ !) ( ✓)
࠷దԽͱωοτϫʔΫͷ NVIͷɿ ֶशͷॳظஈ֊Ͱlocal optimumʹߦ͖͍͢ • AdamͷύϥϝʔλΛௐ • ηͱβ1 ͷͷߴΊʹઃఆ •
Batch NormalizationͱDropoutΛ༻ 14
࣮ݧ 1. CoherenceͱPerplexity • ޙड़ 2. ֶशͱࣄલΛม͑ͨͱ͖ͷޮՌ • ߴֶ͍श &
Dirichlet͕ϕλʔ 3. ςετσʔλʹର͢Δ࠷దԽͷ༗ແ • ͠ͳ͍͍ͯ͘ 4. p(w|β)ͷϦετ • লུ 15
Coherence 16 දจ͔ΒҾ༻ • LDA VAE: ఏҊਪ๏ • prodLDA: ఏҊਪ๏+ఏҊϞσϧ
• LDA DMFVI: Online Mean-Field Variational Inference • NVDM: VAEϕʔεͷจॻϞσϦϯά දͷ: 40ճ࣮ߦͯ͠ࢉग़
Perplexity 17 දจ͔ΒҾ༻
ϨϏϡʔ: ؾʹͳͬͨͷΛ͍͔ͭ͘ Q1. NVDMͰadamͷֶशΛม͑ͨํ͕ެฏ A1. จʹө Q2. ϋΠύʔύϥϝʔλ࠷దԽ͔ͨ͠ A2. ൺֱख๏͍ͯ͠ΔɼఏҊख๏BO
Rating: 6-7-6-5 18
ͦͷଞ • ஶऀ࣮: TensorFlow • NVDMͷஶऀΒͷ৽Ϟσϧ͕ICML2017ʹ࠾ 19