Transferable Dialogue Systems and User Simulators

About

One of the difficulties in training dialogue systems is the lack of training data. We explore the possibility of creating dialogue data through the interaction between a dialogue system and a user simulator. Our goal is to develop a modelling framework that can incorporate new dialogue scenarios through self-play between the two agents. In this framework, we first pre-train the two agents on a collection of source domain dialogues, which equips the agents to converse with each other via natural language. With further fine-tuning on a small amount of target domain data, the agents continue to interact with the aim of improving their behaviors using reinforcement learning with structured reward functions. In experiments on the MultiWOZ dataset, two practical transfer learning problems are investigated: 1) domain adaptation and 2) single-to-multiple domain transfer. We demonstrate that the proposed framework is highly effective in bootstrapping the performance of the two agents in transfer learning. We also show that our method leads to improvements in dialogue system performance on complete datasets.

Bo-Hsiang Tseng, Yinpei Dai, Florian Kreyssig, Bill Byrne• 2021

Related benchmarks

Task	Dataset	Result
End-to-end task-oriented dialogue	MultiWOZ (test)	Task Success Rate94	68
End-to-end task-oriented dialogue	MultiWOZ 2.0 (test)	Inform Accuracy94.7	22
Task-oriented Dialogue	MultiWOZ Restaurant domain 1.0	Combined Score97.6	10
Task-oriented Dialogue	MultiWOZ Hotel domain 1.0	Combined Score100.7	10
Task-oriented Dialogue	MultiWOZ Attraction domain 1.0	Combined Score96	10
Task-oriented Dialogue	MultiWOZ 1.0 (train)	Combined Score104.9	10
Task-oriented Dialogue	MultiWOZ Taxi domain 1.0	Combined Score106.3	10
Task-oriented Dialogue	Human Evaluation 400 dialogues	DS Success Rate74	2

Showing 8 of 8 rows

Other info

Code

Follow for update

@wizwand_team Discord