The Dialogue Dodecathlon: Open-Domain Knowledge and Image Grounded Conversational Agents

About

We introduce dodecaDialogue: a set of 12 tasks that measures if a conversational agent can communicate engagingly with personality and empathy, ask questions, answer questions by utilizing knowledge resources, discuss topics and situations, and perceive and converse about images. By multi-tasking on such a broad large-scale set of data, we hope to both move towards and measure progress in producing a single unified agent that can perceive, reason and converse with humans in an open-domain setting. We show that such multi-tasking improves over a BERT pre-trained baseline, largely due to multi-tasking with very large dialogue datasets in a similar domain, and that the multi-tasking in general provides gains to both text and image-based tasks using several metrics in both the fine-tune and task transfer settings. We obtain state-of-the-art results on many of the tasks, providing a strong baseline for this challenge.

Kurt Shuster, Da Ju, Stephen Roller, Emily Dinan, Y-Lan Boureau, Jason Weston• 2019

Related benchmarks

Task	Dataset	Result
Knowledge-Grounded Dialogue Generation	Wizard of Wikipedia (WoW) Seen (test)	--	10
Image-Response Generation	Image-Chat	Win Rate39	6
Image-Grounded Dialogue Generation	Image-Chat (IC) (test)	F1 Score12.9	5
Dialogue Generation	ConvAI2 (val)	F1 Score21.7	4
Dialogue Generation	EmpatheticDialogues (ED) (test)	F1 Score19.3	4

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord