FUDGE: Controlled Text Generation With Future Discriminators
About
We propose Future Discriminators for Generation (FUDGE), a flexible and modular method for controlled text generation. Given a pre-existing model G for generating text from a distribution of interest, FUDGE enables conditioning on a desired attribute a (for example, formality) while requiring access only to G's output logits. FUDGE learns an attribute predictor operating on a partial sequence, and uses this predictor's outputs to adjust G's original probabilities. We show that FUDGE models terms corresponding to a Bayesian decomposition of the conditional distribution of G given attribute a. Moreover, FUDGE can easily compose predictors for multiple desired attributes. We evaluate FUDGE on three tasks -- couplet completion in poetry, topic control in language generation, and formality change in machine translation -- and observe gains in all three tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Toxicity Mitigation | ToxicRandom | Relevance42.7 | 30 | |
| Toxicity Mitigation | Toxicity Mitigation Task | Generation Speed (s/item)5.82 | 30 | |
| Sentiment transformation | Sentiment Transformation Task | Generation Speed (s/item)6.43 | 30 | |
| Sentiment transformation | IMDB NegToPos 1.0 (test) | Relevance0.411 | 30 | |
| Sentiment transformation | IMDB PosToNeg 1.0 (test) | Relevance0.415 | 30 | |
| Toxicity Mitigation | ToxicTop | Relevance0.36 | 30 | |
| Controllable Text Generation | Yelp (test) | Perplexity (PPL)10.3 | 20 | |
| Toxicity Mitigation | RealToxicityPrompts (test) | Full Toxicity74.4 | 14 | |
| Attribute-Controlled Dialogue Generation | DailyDialog-CG (test) | Emotion Accuracy (E-ACC)60.1 | 12 | |
| Multi-attribute Controlled Text Generation | CompM-CTG (Hold-Out) | Dist-3 (i.d.)0.652 | 10 |