Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise

About

In this paper, we introduce a novel dIffusion language modEl pre-training framework for text generation, which we call GENIE. GENIE is a large-scale pretrained diffusion language model that consists of an encoder and a diffusion-based decoder, which can generate text by gradually transforming a random noise sequence into a coherent text sequence. To pre-train GENIE on a large-scale language corpus, we design a new continuous paragraph denoise objective, which encourages the diffusion-decoder to reconstruct a clean text paragraph from a corrupted version, while preserving the semantic and syntactic coherence. We evaluate GENIE on four downstream text generation benchmarks, namely XSum, CNN/DailyMail, Gigaword, and CommonGen. Our experimental results show that GENIE achieves comparable performance with the state-of-the-art autoregressive models on these benchmarks, and generates more diverse text samples. The code and models of GENIE are available at https://github.com/microsoft/ProphetNet/tree/master/GENIE.

Zhenghao Lin, Yeyun Gong, Yelong Shen, Tong Wu, Zhihao Fan, Chen Lin, Nan Duan, Weizhu Chen• 2022

Related benchmarks

Task	Dataset	Result
Summarization	XSum (test)	ROUGE-219.1	276
Machine Translation	IWSLT De-En 14	BLEU Score29.45	35
Machine Translation	IWSLT En-De 14	SacreBLEU23.89	22
Text Generation	Xsum	ROUGE-129.3	12
Code Generation	Conala	CodeBS66.35	4

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord