Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Zhou et al. 2019. Density Matching for Bilingua...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
tosho
July 04, 2019
Science
330
3
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Zhou et al. 2019. Density Matching for Bilingual Word Embedding. NAACL
tosho
July 04, 2019
More Decks by tosho
See All by tosho
LayerXにおけるセキュリティ管理の現在地と次の一手
tosho
0
180
Experts, Errors, and Context: A Large-Scale Study of Human Evaluation for Machine Translation
tosho
0
320
Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation
tosho
0
390
Shaham and Levy, 2021. Neural Machine Translation without Embeddings. NAACL2021
tosho
0
130
Liu et al., 2021. Pay Attention to MLPs. arXiv
tosho
0
190
Huang et al. 2020 Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting
tosho
0
500
Ive, Madhyastha, Specia_2019_EMNLP_Deep Copycat Networks for Text-to-Text Generation
tosho
0
170
Tan, Bansal_2019_EMNLP_LXMERT Learning Cross-Modality Encoder Representations from Transformers
tosho
0
270
Tsai et al._2019_ACL_Multimodal Transformer for Unaligned Multimodal Language Sequences
tosho
0
450
Other Decks in Science
See All in Science
HajimetenoLT vol.17
hashimoto_kei
1
240
Kritische evaluatie van GenAI-output voor literatuuronderzoek
voginip
0
160
AI(人工知能)の過去・現在・未来 ~AIは人類を越えるのか~
tagtag
PRO
0
100
機械学習 - SVM
trycycle
PRO
2
1.1k
20251212_LT忘年会_データサイエンス枠_新川.pdf
shinpsan
0
290
フィードフォワードニューラルネットワークを用いた記号入出力制御系に対する制御器設計 / Controller Design for Augmented Systems with Symbolic Inputs and Outputs Using Feedforward Neural Network
konakalab
0
140
Testing the Longevity Bottleneck Hypothesis
chinson03
0
320
NDCG is NOT All I Need
statditto
2
3.2k
先端因果推論特別研究チームの研究構想と 人間とAIが協働する自律因果探索の展望
sshimizu2006
3
940
KISHIMOTO Atsuo
genomethica
0
150
データベース06: SQL (3/3) 副問い合わせ
trycycle
PRO
1
980
Inside the Mind of an LLM
baggiponte
0
180
Featured
See All Featured
Code Review Best Practice
trishagee
74
20k
Leveraging Curiosity to Care for An Aging Population
cassininazir
1
270
KATA
mclloyd
PRO
35
15k
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
360
30k
The Art of Programming - Codeland 2020
erikaheidi
57
14k
Let's Do A Bunch of Simple Stuff to Make Websites Faster
chriscoyier
508
140k
XXLCSS - How to scale CSS and keep your sanity
sugarenia
250
1.3M
Building the Perfect Custom Keyboard
takai
2
790
Reality Check: Gamification 10 Years Later
codingconduct
0
2.2k
Put a Button on it: Removing Barriers to Going Fast.
kastner
60
4.3k
Bridging the Design Gap: How Collaborative Modelling removes blockers to flow between stakeholders and teams @FastFlow conf
baasie
0
580
From π to Pie charts
rasagy
0
210
Transcript
Density Matching for Bilingual Word Embedding Chunting Zhou, Xuezhe Ma,
Di Wang, Graham Neubig Language Technologies Institute Carnegie Mellon University
@: • *)5:5> ' -9 ; •
02?=/:5> ' 34% • B<CAIdentical Words (72 ; ,+ • ; • 8&1# Refinement !# • Bilingual Lexicon Induction (BLI) "$.6%
Cross-lingual Word Embedding • B,-7?7A)+5")+ • /*>?7 -7/ 69 •
48(@1; • high-resource -7(@ .= low-resource -748 • ' • Online: !D 0<#?7A)+(@ • Offline: ?-7 (@ 3-7?7A)+215" :C%,(@&$
Offline Cross-lingual Word Embedding • +TO@>:O@KFIS.7&1 • KF)N5;D,3/RG721?J4M • Wasserstein
RG JS #$! %" 721 • CKF)N5;GN-Q • 8=< • KF)N5;<5; D,(/6A09 * 4MPE: D, 3 L • KF)N5; <HB'
DeMa-BWE • Density Matching for Bilingual Word Embedding • OH*V69ADW1/:*U69
• ;Q(,5POH*V69 • <HAD*U\L73J[5P • ADE- "%!% # • GI2KFRCM' B@E-J[ • 5P8. +0 • R`]back-translation$ • Y >N 5PIdentical words ?4 • _1&S=XT)*U^Z1
Contribution • MUSE $&! • )#% morphologically rich
#% * • • "' * +(
Normalizing flows • "- • $!< &1 • 7; !<
='+( )9 • 02# ,/ >35% *. >35% $!< 6:48 !< >35% $!<
Density Estimation in Monolingual Space • %$ • $
! % " • % # %$ x_i " $ & ! %
Density Matching • "$+< ;7#?,.2 ! •
684:(&5% #>03 • KL -*) +< • (&'/ Normalizing flows x #> y #> 1= #> 9= #> 03
Density Matching • %'.A @;&D/15#!#$ • #":<8? +)9(&C37 •
KL #0-, .A • +)*2 Normalizing flows x &C y &C # 4B &C >B &C y &C x >B= 6E #!# 37
Conditional Density Matching • Conditional Density Matching • •
• • •
Weak Orthogonality Constraint • /4.# + Orthogonality ) • *15"*1/4
!8$'7"(/4 ,613, %2 • 7:9 ,-0&
Weak Supervision with Identical Words •
Objectives for DeMa-BWE • Conditional Density Matching Weak Orthogonality
Constraint Weak Supervision with Identical Words
Cross-Domain Similarity Local Scaling • CSLS • % •
CSLS-D • '* #") ! $& '* k-NN ( '*
Iterative Procrustes Refinement • X Y
•
Experiment • MUSE • English ó Spanish; Japanese; Finnish;
... • Pretrained Word Embedding: FastText w/ Wikipedia • Normalizing, Centering • : 0.01 (en), 0.015 (morph-rich), 0.02 (others) • Vocabulary: 10,000 (en-ja), 20,000 (other pairs) • Loss: • back-translation loss: λ = 0.5 • supervised loss: α = 5 (en-zh), 10 (other pairs)
Precision@1 for MUSE BLI task
SL-unsup-ID
Morphologically complex languages
Pearson rank correlation •
Ablation study • Identical Words • en-ja identical
words • Density matching loss • Back-translation loss •
Conclusion • -,0!$// (1)' • &,(1# #*% • .
• Identical Words ++"