A Distributional Approach to Controlled Text Generation
About
We propose a Distributional Approach for addressing Controlled Text Generation from pre-trained Language Models (LMs). This approach permits to specify, in a single formal framework, both "pointwise" and "distributional" constraints over the target LM -- to our knowledge, the first model with such generality -- while minimizing KL divergence from the initial LM distribution. The optimal target distribution is then uniquely determined as an explicit EBM (Energy-Based Model) representation. From that optimal representation we then train a target controlled Autoregressive LM through an adaptive distributional variant of Policy Gradient. We conduct a first set of experiments over pointwise constraints showing the advantages of our approach over a set of baselines, in terms of obtaining a controlled LM balancing constraint satisfaction with divergence from the initial LM. We then perform experiments over distributional constraints, a unique feature of our approach, demonstrating its potential as a remedy to the problem of Bias in Language Models. Through an ablation study, we show the effectiveness of our adaptive technique for obtaining faster convergence. (Code available at https://github.com/naver/gdc)
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Controllable Language Generation | -ve Sentiment Pointwise Constraint | Dist-30.94 | 17 | |
| Controllable Language Generation | Word WikiLeaks Pointwise Constraint | Ctrl Score0.75 | 5 | |
| Controllable Language Generation | Wordlist Politics Pointwise Constraint | Ctrl0.58 | 5 | |
| Controllable Language Generation | Word Amazing Pointwise Constraint | Control Score0.65 | 5 | |
| Controllable Language Generation | Wordlist Science Pointwise Constraint | Ctrl Score52 | 5 | |
| Controllable Language Generation | +ve Sentiment Pointwise Constraint | Control Success Rate59 | 5 | |
| Controllable Language Generation | Hybrid Sports Distributional Constraint | Ctrl Score87 | 3 | |
| Controllable Language Generation | Single Distributional Constraint | Ctrl0.8 | 3 | |
| Controllable Language Generation | Multiple Distributional Constraint | Ctrl0.92 | 3 | |
| Controllable Language Generation | Hybrid Science Distributional Constraint | Ctrl0.68 | 3 |