Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

FUDGE: Controlled Text Generation With Future Discriminators

About

We propose Future Discriminators for Generation (FUDGE), a flexible and modular method for controlled text generation. Given a pre-existing model G for generating text from a distribution of interest, FUDGE enables conditioning on a desired attribute a (for example, formality) while requiring access only to G's output logits. FUDGE learns an attribute predictor operating on a partial sequence, and uses this predictor's outputs to adjust G's original probabilities. We show that FUDGE models terms corresponding to a Bayesian decomposition of the conditional distribution of G given attribute a. Moreover, FUDGE can easily compose predictors for multiple desired attributes. We evaluate FUDGE on three tasks -- couplet completion in poetry, topic control in language generation, and formality change in machine translation -- and observe gains in all three tasks.

Kevin Yang, Dan Klein• 2021

Related benchmarks

TaskDatasetResultRank
Toxicity MitigationToxicRandom
Relevance42.7
30
Toxicity MitigationToxicity Mitigation Task
Generation Speed (s/item)5.82
30
Sentiment transformationSentiment Transformation Task
Generation Speed (s/item)6.43
30
Sentiment transformationIMDB NegToPos 1.0 (test)
Relevance0.411
30
Sentiment transformationIMDB PosToNeg 1.0 (test)
Relevance0.415
30
Toxicity MitigationToxicTop
Relevance0.36
30
Controllable Text GenerationYelp (test)
Perplexity (PPL)10.3
20
Toxicity MitigationRealToxicityPrompts (test)
Full Toxicity74.4
14
Attribute-Controlled Dialogue GenerationDailyDialog-CG (test)
Emotion Accuracy (E-ACC)60.1
12
Multi-attribute Controlled Text GenerationCompM-CTG (Hold-Out)
Dist-3 (i.d.)0.652
10
Showing 10 of 51 rows

Other info

Follow for update