Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

A Distributional Approach to Controlled Text Generation

About

We propose a Distributional Approach for addressing Controlled Text Generation from pre-trained Language Models (LMs). This approach permits to specify, in a single formal framework, both "pointwise" and "distributional" constraints over the target LM -- to our knowledge, the first model with such generality -- while minimizing KL divergence from the initial LM distribution. The optimal target distribution is then uniquely determined as an explicit EBM (Energy-Based Model) representation. From that optimal representation we then train a target controlled Autoregressive LM through an adaptive distributional variant of Policy Gradient. We conduct a first set of experiments over pointwise constraints showing the advantages of our approach over a set of baselines, in terms of obtaining a controlled LM balancing constraint satisfaction with divergence from the initial LM. We then perform experiments over distributional constraints, a unique feature of our approach, demonstrating its potential as a remedy to the problem of Bias in Language Models. Through an ablation study, we show the effectiveness of our adaptive technique for obtaining faster convergence. (Code available at https://github.com/naver/gdc)

Muhammad Khalifa, Hady Elsahar, Marc Dymetman• 2020

Related benchmarks

TaskDatasetResultRank
Controllable Language Generation-ve Sentiment Pointwise Constraint
Dist-30.94
17
Controllable Language GenerationWord WikiLeaks Pointwise Constraint
Ctrl Score0.75
5
Controllable Language GenerationWordlist Politics Pointwise Constraint
Ctrl0.58
5
Controllable Language GenerationWord Amazing Pointwise Constraint
Control Score0.65
5
Controllable Language GenerationWordlist Science Pointwise Constraint
Ctrl Score52
5
Controllable Language Generation+ve Sentiment Pointwise Constraint
Control Success Rate59
5
Controllable Language GenerationHybrid Sports Distributional Constraint
Ctrl Score87
3
Controllable Language GenerationSingle Distributional Constraint
Ctrl0.8
3
Controllable Language GenerationMultiple Distributional Constraint
Ctrl0.92
3
Controllable Language GenerationHybrid Science Distributional Constraint
Ctrl0.68
3
Showing 10 of 10 rows

Other info

Follow for update