Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Discrete Flow Matching

About

Despite Flow Matching and diffusion models having emerged as powerful generative paradigms for continuous variables such as images and videos, their application to high-dimensional discrete data, such as language, is still limited. In this work, we present Discrete Flow Matching, a novel discrete flow paradigm designed specifically for generating discrete data. Discrete Flow Matching offers several key contributions:(i) it works with a general family of probability paths interpolating between source and target distributions; (ii) it allows for a generic formula for sampling from these probability paths using learned posteriors such as the probability denoiser ($x$-prediction) and noise-prediction ($\epsilon$-prediction); (iii) practically, focusing on specific probability paths defined with different schedulers improves generative perplexity compared to previous discrete diffusion and flow models; and (iv) by scaling Discrete Flow Matching models up to 1.7B parameters, we reach 6.7% Pass@1 and 13.4% Pass@10 on HumanEval and 6.7% Pass@1 and 20.6% Pass@10 on 1-shot MBPP coding benchmarks. Our approach is capable of generating high-quality discrete data in a non-autoregressive fashion, significantly closing the gap between autoregressive models and discrete flow models.

Itai Gat, Tal Remez, Neta Shaul, Felix Kreuk, Ricky T. Q. Chen, Gabriel Synnaeve, Yossi Adi, Yaron Lipman• 2024

Related benchmarks

TaskDatasetResultRank
Text GenerationOpenWebText
Perplexity146.5
86
Image GenerationCIFAR-10 (train/test)
FID3.63
78
Text GenerationWikiText-103
Perplexity69.06
23
Molecule GenerationGuacaMol
Validity86.6
20
Numerical ReasoningCountdown 4
CD487.5
13
Molecule GenerationMOSES
Validity88.3
11
Molecule GenerationQM9 without H
Validity99.3
10
DNA enhancer designHepG2 (test)
Pred. Activity0.64
6
Sudoku SolvingKaggle Unfiltered (generalization)
Accuracy44.5
6
Image GenerationCIFAR-10 Color
FID36.91
5
Showing 10 of 13 rows

Other info

Follow for update