A Simple Language Model for Task-Oriented Dialogue

About

Task-oriented dialogue is often decomposed into three tasks: understanding user input, deciding actions, and generating a response. While such decomposition might suggest a dedicated model for each sub-task, we find a simple, unified approach leads to state-of-the-art performance on the MultiWOZ dataset. SimpleTOD is a simple approach to task-oriented dialogue that uses a single, causal language model trained on all sub-tasks recast as a single sequence prediction problem. This allows SimpleTOD to fully leverage transfer learning from pre-trained, open domain, causal language models such as GPT-2. SimpleTOD improves over the prior state-of-the-art in joint goal accuracy for dialogue state tracking, and our analysis reveals robustness to noisy annotations in this setting. SimpleTOD also improves the main metrics used to evaluate action decisions and response generation in an end-to-end setting: inform rate by 8.1 points, success rate by 9.7 points, and combined score by 7.2 points.

Ehsan Hosseini-Asl, Bryan McCann, Chien-Sheng Wu, Semih Yavuz, Richard Socher• 2020

Related benchmarks

Task	Dataset	Result
Dialogue State Tracking	MultiWOZ 2.1 (test)	Joint Goal Accuracy56.45	105
Dialogue State Tracking	MultiWOZ 2.2 (test)	Joint Goal Accuracy54.02	98
Dialog State Tracking	MultiWOZ 2.1 (test)	Joint Goal Accuracy57.47	88
End-to-end task-oriented dialogue	MultiWOZ (test)	Task Success Rate70.1	68
End-to-end task-oriented dialogue	MultiWOZ 2.1 (test)	BLEU Score16.22	57
Dialog State Tracking	MultiWOZ 2.0 (test)	Joint Goal Accuracy51.37	47
Dialogue State Tracking	MultiWOZ 2.1	Joint Goal Accuracy55.7	46
Task-oriented Dialogue	MultiWOZ 2.0 (test)	Inform Rate84.4	37
Task-oriented Dialogue Response Generation	Multi-WOZ 2.1 (test)	BLEU16.01	22
End-to-end task-oriented dialogue	MultiWOZ 2.0 (test)	Inform Accuracy88.9	22

Showing 10 of 27 rows

Other info

Code

Follow for update

@wizwand_team Discord