A Diversity-Promoting Objective Function for Neural Conversation Models
About
Sequence-to-sequence neural network models for generation of conversational responses tend to generate safe, commonplace responses (e.g., "I don't know") regardless of the input. We suggest that the traditional objective function, i.e., the likelihood of output (response) given input (message) is unsuited to response generation tasks. Instead we propose using Maximum Mutual Information (MMI) as the objective function in neural models. Experimental results demonstrate that the proposed MMI models produce more diverse, interesting, and appropriate responses, yielding substantive gains in BLEU scores on two conversational datasets and in human evaluations.
Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, Bill Dolan• 2015
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| prompt_gen | ConTest 200 with_hds | Spearman Rho0.573 | 12 | |
| System ranking correlation | CRSArena-Eval Turn-level | Pearson Correlation (r)0.716 | 9 | |
| System ranking correlation | CRSArena-Eval Dial-level | Pearson Correlation (r)0.665 | 9 | |
| System ranking correlation | CRSArena-Eval (All) | Pearson Correlation (r)0.68 | 9 | |
| Prompt Generation | DecTest prompt_gen 1000 samples no_hds | Spearman Rho0.917 | 7 | |
| Dialogue Response Generation | Dialogue Dataset (test) | Adversarial Success9 | 7 | |
| Response Generation | DecTest resp_gen no_hds (1000 samples) | Spearman ρ0.894 | 7 | |
| Story Generation | DecTest story_gen no_hds (1000 samples) | Spearman ρ0.758 | 7 | |
| resp_gen | McDiv full no_hds ~2K | Spearman rho0.517 | 6 | |
| story_gen | McDiv full no_hds ~2K | Spearman rho0.535 | 6 |
Showing 10 of 15 rows