et al., 2016; Calixto et al., 2016; Libovický and Helcl, 2017; Helcl et al., 2018] • #!(")% • Cross-modal interactions with spatially-unaware global features [Calixto and Liu, 2017; Ma et al., 2017; Caglayan et al., 2017a; Madhyastha et al., 2017] • $!(")% • The integration of regional features from object detection networks [Huang et al., 2016; Grönroos et al., 2018] • 4 5/16/19
• /0Two young girls are sitting on the street eating food. • -0 Zwei junge mädchen sitzen auf der straße und essen mais. • :B • ($&")+ • ,C>4 • '*$(#* 8? =@ • /0932!D'*$(#*(%*!A7 6 5/16/19
et al., 2007] • data splits: • English to French • 9,951 English and 11,216 French wordsBPE • Degradation train/dev/test Dataset Color Deprivation Progressive Masking Entity Masking train (multi30k) train train val (multi30k) train dev test2016 dev test test2017 test - 10 5/16/19
• Color Deprivation # " $ • Color Deprivation # % • +1.6 METEOR (HIER vs NMT) • +12% color accuracy (HIER vs NMT) • +4% color accuracy (DIRECT vs NMT) 18 5/16/19