Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration

About

The diffusion model, a new generative modeling paradigm, has achieved significant success in generating images, audio, video, and text. It has been adapted for sequence-to-sequence text generation (Seq2Seq) through DiffuSeq, termed S2S Diffusion. Existing S2S-Diffusion models predominantly rely on fixed or hand-crafted rules to schedule noise during the diffusion and denoising processes. However, these models are limited by non-contextualized noise, which fails to fully consider the characteristics of Seq2Seq tasks. In this paper, we propose the Meta-DiffuB framework - a novel scheduler-exploiter S2S-Diffusion paradigm designed to overcome the limitations of existing S2S-Diffusion models. We employ Meta-Exploration to train an additional scheduler model dedicated to scheduling contextualized noise for each sentence. Our exploiter model, an S2S-Diffusion model, leverages the noise scheduled by our scheduler model for updating and generation. Meta-DiffuB achieves state-of-the-art performance compared to previous S2S-Diffusion models and fine-tuned pre-trained language models (PLMs) across four Seq2Seq benchmark datasets. We further investigate and visualize the impact of Meta-DiffuB's noise scheduling on the generation of sentences with varying difficulties. Additionally, our scheduler model can function as a "plug-and-play" model to enhance DiffuSeq without the need for fine-tuning during the inference stage.

Yun-Yen Chuang, Hung-Min Hsu, Kevin Lin, Chen-Sheng Gu, Ling Zhen Li, Ray-I Chang, Hung-yi Lee• 2024

Related benchmarks

TaskDatasetResultRank
ParaphrasingQQP
BLEU26.32
22
Machine TranslationIWSLT14 DE-EN
BLEU Score33.82
22
Seq2Seq generationQQP
BLEU0.3142
17
Question GenerationQT
BLEU18.2
14
Text SimplificationWikiAuto
BLEU39.57
14
Machine TranslationWMT14 DE-EN
SacreBLEU32.09
13
Question GenerationQG
BLEU22.71
8
Seq2Seq generationWA
BLEU0.3877
7
Seq2Seq generationQT
BLEU0.182
7
Seq2Seq generationCC
BLEU2.2
7
Showing 10 of 10 rows

Other info

Code

Follow for update