Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Budzianowski et al. - EMNLP 2018 - MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling
Search
tosho
December 10, 2018
Research
0
270
Budzianowski et al. - EMNLP 2018 - MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling
tosho
December 10, 2018
Tweet
Share
More Decks by tosho
See All by tosho
Experts, Errors, and Context: A Large-Scale Study of Human Evaluation for Machine Translation
tosho
0
220
Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation
tosho
0
290
Shaham and Levy, 2021. Neural Machine Translation without Embeddings. NAACL2021
tosho
0
69
Liu et al., 2021. Pay Attention to MLPs. arXiv
tosho
0
110
Huang et al. 2020 Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting
tosho
0
330
Ive, Madhyastha, Specia_2019_EMNLP_Deep Copycat Networks for Text-to-Text Generation
tosho
0
74
Tan, Bansal_2019_EMNLP_LXMERT Learning Cross-Modality Encoder Representations from Transformers
tosho
0
150
Tsai et al._2019_ACL_Multimodal Transformer for Unaligned Multimodal Language Sequences
tosho
0
240
Zhou et al. 2019. Density Matching for Bilingual Word Embedding. NAACL
tosho
3
180
Other Decks in Research
See All in Research
Deep State Space Models 101 / Mamba
kurita
9
3.8k
時系列解析と疫学
kingqwert
2
950
20240127_熊本から今いちど真面目に都市交通~めざせ「車1割削減、渋滞半減、公共交通2倍」~ 全国路面電車サミット2024宇都宮
trafficbrain
1
690
LLMマルチエージェントを俯瞰する
masatoto
26
17k
デフスポーツにおける支援技術 〜競技特性・ルールと技術との関係〜
slab
0
260
Equivalence of Geodesics and Importance Weighting from the Perspective of Information Geometry
mkimura
0
140
ゼロからわかるリザバーコンピューティング
kurotaky
1
350
Embodied AIについて / About Embodied AI
nttcom
1
690
Alternative Photographic Processes Reimagined: The Role of Digital Technology in Revitalizing Classic Printing Techniques【SIGGRAPH Asia 2023】
toremolo72
0
460
言語間転移学習で大規模言語モデルを賢くする
ikuyamada
8
3.8k
[KDD2023論文読み会] BERT4CTR: An Efficient Framework to Combine Pre-trained Language Model with Non-textual Features for CTR Prediction / KDD2023 LY Tech Reading
shunk031
0
490
MegaParticles: GPUを利用したStein Particle Filterによる点群6自由度姿勢推定
koide3
1
590
Featured
See All Featured
Unsuck your backbone
ammeep
664
57k
VelocityConf: Rendering Performance Case Studies
addyosmani
321
23k
The World Runs on Bad Software
bkeepers
PRO
61
6.7k
A designer walks into a library…
pauljervisheath
201
23k
Designing on Purpose - Digital PM Summit 2013
jponch
111
6.5k
Large-scale JavaScript Application Architecture
addyosmani
504
110k
Stop Working from a Prison Cell
hatefulcrawdad
266
19k
The Illustrated Children's Guide to Kubernetes
chrisshort
32
47k
A Modern Web Designer's Workflow
chriscoyier
689
190k
How to train your dragon (web standard)
notwaldorf
75
5.2k
What's new in Ruby 2.0
geeforr
338
31k
Let's Do A Bunch of Simple Stuff to Make Websites Faster
chriscoyier
501
140k
Transcript
MultiWOZ – A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue
Modeling Tosho Hirasawa
0. Overview • -6<+?E$> • 4L3I/%) H2 • :@Multi-Domain
Wizard-of-Oz (MultiWOZ) • KJ • ("*72 GA/9F8 #!= 5 ,1 • 0.BD&* ' &(*;C
1. Introduction • Conversational Artificial Intelligence • human-level *)&($ •
#%' ! • Seneff and Polifroni, 2000 • "Raux et al., 2005 • Amazon AlexaRam et al., 2018
1. Introduction • \T@F [C0*%0# RA •
2DKU • =W:J • ?6) 8V • OXN3A • PH517 E2E ,"/LI • <];Z17MYB( >E • &!-0Q • " 9 • [C$+0_4D • GS5'.-0^
1. Introduction , , 2017
2. Related Works • >K&.(%3/9 ! • Machine-to-Machine • *5/4+"O6K"R
• HLJ-$) T DM6K\E ]X • Human-to-Machine • 7:=@^Y'(*0UZ9";I • G OE! :B • HLJ^Y'(*0 YS?,1$5&.(NI • Human-to-Human • G<QW &(+< • Twitter, Reddit, Ubuntu 6K"_8NI! • HLJ6KC[ AP#-*'25 FV
3. Data Collection Set-up • Wizard-of-Oz E4 • Dialogue Task:
• *,-@ ontology random sampling !'#%"8(6 • User Side: • (6=1 97CF.;A • System (Wizard) Side: • $ 2: 97/D • Wizard/User (6>, (6JG+ • (6)I30< • (6H5&?B)I30
3. Data Collection Set-up • Annotation of Dialogue Acts •
Dialogue Act = intent + slot-value pairs • intent: inform / request • slot-value: domain, price, … • Amazon Mechanical Turk +!" &$ dialogue acts .) • !" &$ '- /( • % ,*0.8843#0
4. MultiWOZ Dialogue Corpus •
: domain
4. MultiWOZ Dialogue Corpus : expensive : domain
4. MultiWOZ Dialogue Corpus • (turns in a
dialogue) • 8.93 (single-domain), 15.39 (multi-domain) • 115,434 turns • >70% 10 turns • (sentence length) • 11.75 (user), 15.12 (wizard)
4. MultiWOZ Dialogue Corpus • Dialogue Acts • 60% turns
action • %# • "$ • %# !"$
4. MultiWOZ Dialogue Corpus • •
• Multi-Domain, Dialogue Act
5. MultiWOZ as a New Benchmark • Dialogue modelling task
• Dialogue State Tracking • (,# '/ • &,.5-0)1 ontology • Dialogue-Context-to-Text Generation • (,Dialogue State, # '/ • &,!16 • Cam676/MultiWOZ 28 • % $"+* • RNN 473 • Cam676: GRU • MultiWOZ: LSTM
5. MultiWOZ as a New Benchmark • Dialogue-Act-to-Text Generation •
Structured meaning representation (Dialogue Act?) • • Semantically Conditioned LSTM (Wen+, 2015) • SFX MultiWOZ restaurant • SER = (missing slots + redundant slots) / total slots Wen+, 2015
6. Conclusion • )1"&7* 8 E2E #$20
• Modular-based (+%' • MultiWOZ 3 46 • !-53. github /,