A Diversity-Promoting Objective Function for Neural Conversation Models

About

Sequence-to-sequence neural network models for generation of conversational responses tend to generate safe, commonplace responses (e.g., "I don't know") regardless of the input. We suggest that the traditional objective function, i.e., the likelihood of output (response) given input (message) is unsuited to response generation tasks. Instead we propose using Maximum Mutual Information (MMI) as the objective function in neural models. Experimental results demonstrate that the proposed MMI models produce more diverse, interesting, and appropriate responses, yielding substantive gains in BLEU scores on two conversational datasets and in human evaluations.

Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, Bill Dolan• 2015

Related benchmarks

Task	Dataset	Result
prompt_gen	ConTest 200 with_hds	Spearman Rho0.573	12
Conversation Evaluation	CRSArena-Eval Turn-level	Pearson Correlation (r)0.716	9
Conversation Evaluation	CRSArena-Eval Dial-level	Pearson r0.665	9
Conversation Evaluation	CRSArena-Eval (All)	Pearson Correlation (r)0.68	9
System ranking correlation	CRSArena-Eval Turn-level	Pearson Correlation (r)0.716	9
System ranking correlation	CRSArena-Eval Dial-level	Pearson Correlation (r)0.665	9
System ranking correlation	CRSArena-Eval (All)	Pearson Correlation (r)0.68	9
LLM Generator Selection	33 task-language combinations (Human-annotated data) (test)	Top-1 Match Rate15.1515	9
Prompt Generation	DecTest prompt_gen 1000 samples no_hds	Spearman Rho0.917	7
Dialogue Response Generation	Dialogue Dataset (test)	Adversarial Success9	7

Showing 10 of 19 rows

Other info

Follow for update

@wizwand_team Discord