Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

A Reparameterized Discrete Diffusion Model for Text Generation

About

This work studies discrete diffusion probabilistic models with applications to natural language generation. We derive an alternative yet equivalent formulation of the sampling from discrete diffusion processes and leverage this insight to develop a family of reparameterized discrete diffusion models. The derived generic framework is highly flexible, offers a fresh perspective of the generation process in discrete diffusion models, and features more effective training and decoding techniques. We conduct extensive experiments to evaluate the text generation capability of our model, demonstrating significant improvements over existing diffusion models.

Lin Zheng, Jianbo Yuan, Lei Yu, Lingpeng Kong• 2023

Related benchmarks

TaskDatasetResultRank
Machine TranslationIWSLT De-En 14
BLEU Score32.14
33
Text SimplificationWikiAuto
BLEU43.86
29
ParaphrasingQQP
BLEU30.83
22
Machine TranslationWMT En-De '14
SacreBLEU26.54
22
Seq2SeqQQP
ROUGE-L59.5
18
Seq2Seq generationQQP
BLEU0.251
17
Question GenerationQT
BLEU16.83
14
Numerical ReasoningCountdown 4
CD487
13
ParaphrasingQQP
Semantic Faithfulness83.93
11
Question GenerationQG
BLEU18.02
8
Showing 10 of 10 rows

Other info

Follow for update