Joint Generator-Ranker Learning for Natural Language Generation

About

Generate-then-rank is a widely used mechanism for text generation, where a generator produces multiple text candidates and a ranker chooses the best one among the text candidates. However, existing methods usually train the generator and the ranker individually, neglecting the mutual feedback that could further enhance the generation quality. To tackle this limitation, we propose JGR, a novel joint training algorithm that integrates the generator and the ranker in a single framework. JGR optimizes the generator with a hybrid objective that combines data likelihood and ranker reward, and trains the ranker with a contrastive loss that compares the generator outputs. By iteratively updating the generator and the ranker, JGR can effectively harmonize their learning and enhance their quality jointly. We evaluate JGR on various text generation tasks and demonstrate that it surpasses existing methods on four public datasets across three common generation scenarios. Our code and models are publicly available at https://github.com/microsoft/ProphetNet/tree/master/JGR.

Weizhou Shen, Yeyun Gong, Yelong Shen, Song Wang, Xiaojun Quan, Nan Duan, Weizhu Chen• 2022

Related benchmarks

Task	Dataset	Result
Abstractive Text Summarization	CNN/Daily Mail (test)	ROUGE-L46.56	169
Dialogue Summarization	SamSum (test)	ROUGE-229.48	80
Summarization	CNN/DailyMail (test)	--	33
Question Generation	SQuAD 1.1 (test)	BLEU-424.73	29
Dialogue Response Generation	Persona-Chat	BLEU-153.3	20

Showing 5 of 5 rows

Other info

Code

Follow for update

@wizwand_team Discord