Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

The Dialogue Dodecathlon: Open-Domain Knowledge and Image Grounded Conversational Agents

About

We introduce dodecaDialogue: a set of 12 tasks that measures if a conversational agent can communicate engagingly with personality and empathy, ask questions, answer questions by utilizing knowledge resources, discuss topics and situations, and perceive and converse about images. By multi-tasking on such a broad large-scale set of data, we hope to both move towards and measure progress in producing a single unified agent that can perceive, reason and converse with humans in an open-domain setting. We show that such multi-tasking improves over a BERT pre-trained baseline, largely due to multi-tasking with very large dialogue datasets in a similar domain, and that the multi-tasking in general provides gains to both text and image-based tasks using several metrics in both the fine-tune and task transfer settings. We obtain state-of-the-art results on many of the tasks, providing a strong baseline for this challenge.

Kurt Shuster, Da Ju, Stephen Roller, Emily Dinan, Y-Lan Boureau, Jason Weston• 2019

Related benchmarks

TaskDatasetResultRank
Knowledge-Grounded Dialogue GenerationWizard of Wikipedia (WoW) Seen (test)--
10
Image-Response GenerationImage-Chat
Win Rate39
6
Image-Grounded Dialogue GenerationImage-Chat (IC) (test)
F1 Score12.9
5
Dialogue GenerationConvAI2 (val)
F1 Score21.7
4
Dialogue GenerationEmpatheticDialogues (ED) (test)
F1 Score19.3
4
Showing 5 of 5 rows

Other info

Follow for update