Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Contrastive Decoding: Open-ended Text Generation as Optimization

About

Given a language model (LM), maximum probability is a poor decoding objective for open-ended generation, because it produces short and repetitive text. On the other hand, sampling can often produce incoherent text that drifts from the original topics. We propose contrastive decoding (CD), a reliable decoding approach that optimizes a contrastive objective subject to a plausibility constraint. The contrastive objective returns the difference between the likelihood under a large LM (called the expert, e.g. OPT-13B) and a small LM (called the amateur, e.g. OPT-125M), and the constraint ensures that the outputs are plausible. CD is inspired by the fact that the failures of larger LMs (e.g., repetition, incoherence) are even more prevalent in smaller LMs, and that this difference signals which texts should be preferred. CD requires zero additional training, and produces higher quality text than decoding from the larger LM alone. It also works across model scales (OPT-13B and GPT2-1.5B) and significantly outperforms four strong decoding algorithms (e.g., nucleus, top-k) in automatic and human evaluations across wikipedia, news and story domains.

Xiang Lisa Li, Ari Holtzman, Daniel Fried, Percy Liang, Jason Eisner, Tatsunori Hashimoto, Luke Zettlemoyer, Mike Lewis• 2022

Related benchmarks

TaskDatasetResultRank
Visual Question AnsweringVQA v2
Accuracy80.47
1165
Visual Question AnsweringVizWiz
Accuracy63.29
1043
Mathematical ReasoningGSM8K
Accuracy89.31
983
Visual Question AnsweringGQA
Accuracy72.2
963
Code GenerationHumanEval
Pass@135.98
850
Text-based Visual Question AnsweringTextVQA
Accuracy77.96
496
Instruction FollowingIFEval
Accuracy (0-100)31.97
292
Code GenerationHumanEval+
Pass@160.98
189
Mathematical ReasoningMATH500
Accuracy (ACC)21.4
133
Question AnsweringSQuAD
F183.1
127
Showing 10 of 44 rows

Other info

Code

Follow for update