Reflective Decoding: Beyond Unidirectional Generation with Off-the-Shelf Language Models

About

Publicly available, large pretrained LanguageModels (LMs) generate text with remarkable quality, but only sequentially from left to right. As a result, they are not immediately applicable to generation tasks that break the unidirectional assumption, such as paraphrasing or text-infilling, necessitating task-specific supervision. In this paper, we present Reflective Decoding, a novel unsupervised algorithm that allows for direct application of unidirectional LMs to non-sequential tasks. Our 2-step approach requires no supervision or even parallel corpora, only two off-the-shelf pretrained LMs in opposite directions: forward and backward. First, in the contextualization step, we use LMs to generate ensembles of past and future contexts which collectively capture the input (e.g. the source sentence for paraphrasing). Second, in the reflection step, we condition on these "context ensembles", generating outputs that are compatible with them. Comprehensive empirical results demonstrate that Reflective Decoding outperforms strong unsupervised baselines on both paraphrasing and abductive text infilling, significantly narrowing the gap between unsupervised and supervised methods. Reflective Decoding surpasses multiple supervised baselines on various metrics including human evaluation.

Peter West, Ximing Lu, Ari Holtzman, Chandra Bhagavatula, Jena Hwang, Yejin Choi• 2020

Related benchmarks

Task	Dataset	Result
Question Answering	ELI5 Wiki-answerable	ROUGE-L Score18.2	14
Question Answering	PubMedQA (out-of-domain)	ROUGE-L9.2	14
Question Answering	MSMARCO Wiki-answerable	ROUGE-L44.7	14
Question Answering	PIQA out-of-domain	ROUGE-L16.7	14
Natural language generation	aNLG	Relevance (Mean)5.27	5
Natural language generation	ELI5	Relevance (Mean)4.73	5
Natural language generation	WoW	Mean Relevance4.35	5

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord