A Reparameterized Discrete Diffusion Model for Text Generation
About
This work studies discrete diffusion probabilistic models with applications to natural language generation. We derive an alternative yet equivalent formulation of the sampling from discrete diffusion processes and leverage this insight to develop a family of reparameterized discrete diffusion models. The derived generic framework is highly flexible, offers a fresh perspective of the generation process in discrete diffusion models, and features more effective training and decoding techniques. We conduct extensive experiments to evaluate the text generation capability of our model, demonstrating significant improvements over existing diffusion models.
Lin Zheng, Jianbo Yuan, Lei Yu, Lingpeng Kong• 2023
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Machine Translation | IWSLT De-En 14 | BLEU Score32.14 | 33 | |
| Text Simplification | WikiAuto | BLEU43.86 | 29 | |
| Paraphrasing | QQP | BLEU30.83 | 22 | |
| Machine Translation | WMT En-De '14 | SacreBLEU26.54 | 22 | |
| Seq2Seq | QQP | ROUGE-L59.5 | 18 | |
| Seq2Seq generation | QQP | BLEU0.251 | 17 | |
| Question Generation | QT | BLEU16.83 | 14 | |
| Numerical Reasoning | Countdown 4 | CD487 | 13 | |
| Paraphrasing | QQP | Semantic Faithfulness83.93 | 11 | |
| Question Generation | QG | BLEU18.02 | 8 |
Showing 10 of 10 rows