Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Joint Generator-Ranker Learning for Natural Language Generation

About

Generate-then-rank is a widely used mechanism for text generation, where a generator produces multiple text candidates and a ranker chooses the best one among the text candidates. However, existing methods usually train the generator and the ranker individually, neglecting the mutual feedback that could further enhance the generation quality. To tackle this limitation, we propose JGR, a novel joint training algorithm that integrates the generator and the ranker in a single framework. JGR optimizes the generator with a hybrid objective that combines data likelihood and ranker reward, and trains the ranker with a contrastive loss that compares the generator outputs. By iteratively updating the generator and the ranker, JGR can effectively harmonize their learning and enhance their quality jointly. We evaluate JGR on various text generation tasks and demonstrate that it surpasses existing methods on four public datasets across three common generation scenarios. Our code and models are publicly available at https://github.com/microsoft/ProphetNet/tree/master/JGR.

Weizhou Shen, Yeyun Gong, Yelong Shen, Song Wang, Xiaojun Quan, Nan Duan, Weizhu Chen• 2022

Related benchmarks

TaskDatasetResultRank
Abstractive Text SummarizationCNN/Daily Mail (test)
ROUGE-L46.56
169
Dialogue SummarizationSamSum (test)
ROUGE-229.48
80
Question GenerationSQuAD 1.1 (test)
BLEU-424.73
29
SummarizationCNN/DailyMail (test)--
22
Dialogue Response GenerationPersona-Chat
BLEU-153.3
20
Showing 5 of 5 rows

Other info

Code

Follow for update