Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Zhou et al. 2019. Density Matching for Bilingua...
Search
tosho
July 04, 2019
Science
330
3
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Zhou et al. 2019. Density Matching for Bilingual Word Embedding. NAACL
tosho
July 04, 2019
More Decks by tosho
See All by tosho
LayerXにおけるセキュリティ管理の現在地と次の一手
tosho
0
180
Experts, Errors, and Context: A Large-Scale Study of Human Evaluation for Machine Translation
tosho
0
320
Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation
tosho
0
390
Shaham and Levy, 2021. Neural Machine Translation without Embeddings. NAACL2021
tosho
0
130
Liu et al., 2021. Pay Attention to MLPs. arXiv
tosho
0
190
Huang et al. 2020 Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting
tosho
0
500
Ive, Madhyastha, Specia_2019_EMNLP_Deep Copycat Networks for Text-to-Text Generation
tosho
0
170
Tan, Bansal_2019_EMNLP_LXMERT Learning Cross-Modality Encoder Representations from Transformers
tosho
0
270
Tsai et al._2019_ACL_Multimodal Transformer for Unaligned Multimodal Language Sequences
tosho
0
450
Other Decks in Science
See All in Science
機械学習 - ニューラルネットワーク入門
trycycle
PRO
0
1k
Physical AIを支えるWeights & Biases
olachinkei
1
370
Van Dare naar Durf
voginip
0
230
データベース05: SQL(2/3) 結合質問
trycycle
PRO
0
1.2k
Inside the Mind of an LLM
baggiponte
0
180
[NLP2026 参加報告会] AI for Science まとめ / NLP2026
lychee1223
0
1.9k
YouTubeにおける撤回論文の参照実態 / metascience-meetup2026
corgies
3
290
Kritische evaluatie van GenAI-output voor literatuuronderzoek
voginip
0
160
MATSUO Makiko
genomethica
0
150
CVPR2026_VGGTとその仲間たち
mickey_0226
0
820
Tensor Factorization Meets Deformed Information Geometry: Convex Relaxation under Deformed Algebra
gkazunii
0
110
機械学習 - K-means & 階層的クラスタリング
trycycle
PRO
0
1.7k
Featured
See All Featured
The Hidden Cost of Media on the Web [PixelPalooza 2025]
tammyeverts
2
330
Exploring anti-patterns in Rails
aemeredith
3
410
Designing Experiences People Love
moore
143
24k
Max Prin - Stacking Signals: How International SEO Comes Together (And Falls Apart)
techseoconnect
PRO
0
180
The Art of Programming - Codeland 2020
erikaheidi
57
14k
Facilitating Awesome Meetings
lara
57
7k
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
123
22k
Hiding What from Whom? A Critical Review of the History of Programming languages for Music
tomoyanonymous
2
850
WENDY [Excerpt]
tessaabrams
11
38k
Embracing the Ebb and Flow
colly
88
5.1k
brightonSEO & MeasureFest 2025 - Christian Goodrich - Winning strategies for Black Friday CRO & PPC
cargoodrich
3
730
The agentic SEO stack - context over prompts
schlessera
0
820
Transcript
Density Matching for Bilingual Word Embedding Chunting Zhou, Xuezhe Ma,
Di Wang, Graham Neubig Language Technologies Institute Carnegie Mellon University
@: • *)5:5> ' -9 ; •
02?=/:5> ' 34% • B<CAIdentical Words (72 ; ,+ • ; • 8&1# Refinement !# • Bilingual Lexicon Induction (BLI) "$.6%
Cross-lingual Word Embedding • B,-7?7A)+5")+ • /*>?7 -7/ 69 •
48(@1; • high-resource -7(@ .= low-resource -748 • ' • Online: !D 0<#?7A)+(@ • Offline: ?-7 (@ 3-7?7A)+215" :C%,(@&$
Offline Cross-lingual Word Embedding • +TO@>:O@KFIS.7&1 • KF)N5;D,3/RG721?J4M • Wasserstein
RG JS #$! %" 721 • CKF)N5;GN-Q • 8=< • KF)N5;<5; D,(/6A09 * 4MPE: D, 3 L • KF)N5; <HB'
DeMa-BWE • Density Matching for Bilingual Word Embedding • OH*V69ADW1/:*U69
• ;Q(,5POH*V69 • <HAD*U\L73J[5P • ADE- "%!% # • GI2KFRCM' B@E-J[ • 5P8. +0 • R`]back-translation$ • Y >N 5PIdentical words ?4 • _1&S=XT)*U^Z1
Contribution • MUSE $&! • )#% morphologically rich
#% * • • "' * +(
Normalizing flows • "- • $!< &1 • 7; !<
='+( )9 • 02# ,/ >35% *. >35% $!< 6:48 !< >35% $!<
Density Estimation in Monolingual Space • %$ • $
! % " • % # %$ x_i " $ & ! %
Density Matching • "$+< ;7#?,.2 ! •
684:(&5% #>03 • KL -*) +< • (&'/ Normalizing flows x #> y #> 1= #> 9= #> 03
Density Matching • %'.A @;&D/15#!#$ • #":<8? +)9(&C37 •
KL #0-, .A • +)*2 Normalizing flows x &C y &C # 4B &C >B &C y &C x >B= 6E #!# 37
Conditional Density Matching • Conditional Density Matching • •
• • •
Weak Orthogonality Constraint • /4.# + Orthogonality ) • *15"*1/4
!8$'7"(/4 ,613, %2 • 7:9 ,-0&
Weak Supervision with Identical Words •
Objectives for DeMa-BWE • Conditional Density Matching Weak Orthogonality
Constraint Weak Supervision with Identical Words
Cross-Domain Similarity Local Scaling • CSLS • % •
CSLS-D • '* #") ! $& '* k-NN ( '*
Iterative Procrustes Refinement • X Y
•
Experiment • MUSE • English ó Spanish; Japanese; Finnish;
... • Pretrained Word Embedding: FastText w/ Wikipedia • Normalizing, Centering • : 0.01 (en), 0.015 (morph-rich), 0.02 (others) • Vocabulary: 10,000 (en-ja), 20,000 (other pairs) • Loss: • back-translation loss: λ = 0.5 • supervised loss: α = 5 (en-zh), 10 (other pairs)
Precision@1 for MUSE BLI task
SL-unsup-ID
Morphologically complex languages
Pearson rank correlation •
Ablation study • Identical Words • en-ja identical
words • Density matching loss • Back-translation loss •
Conclusion • -,0!$// (1)' • &,(1# #*% • .
• Identical Words ++"