Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Flow Matching for Generative Modeling

About

We introduce a new paradigm for generative modeling built on Continuous Normalizing Flows (CNFs), allowing us to train CNFs at unprecedented scale. Specifically, we present the notion of Flow Matching (FM), a simulation-free approach for training CNFs based on regressing vector fields of fixed conditional probability paths. Flow Matching is compatible with a general family of Gaussian probability paths for transforming between noise and data samples -- which subsumes existing diffusion paths as specific instances. Interestingly, we find that employing FM with diffusion paths results in a more robust and stable alternative for training diffusion models. Furthermore, Flow Matching opens the door to training CNFs with other, non-diffusion probability paths. An instance of particular interest is using Optimal Transport (OT) displacement interpolation to define the conditional probability paths. These paths are more efficient than diffusion paths, provide faster training and sampling, and result in better generalization. Training CNFs using Flow Matching on ImageNet leads to consistently better performance than alternative diffusion-based methods in terms of both likelihood and sample quality, and allows fast and reliable sample generation using off-the-shelf numerical ODE solvers.

Yaron Lipman, Ricky T. Q. Chen, Heli Ben-Hamu, Maximilian Nickel, Matt Le• 2022

Related benchmarks

TaskDatasetResultRank
Text-to-Image GenerationGenEval
GenEval Score77
277
Unconditional Image GenerationCIFAR-10 unconditional
FID6.35
159
Text-to-Audio GenerationAudioCaps (test)
FAD1.22
138
Density EstimationCIFAR-10 (test)
Bits/dim2.99
134
Image GenerationImageNet 64x64
FID13.93
114
Image GenerationCIFAR-10
FID4.35
95
Text-to-Image GenerationDPG-Bench
DPG Score78.31
89
Image GenerationImageNet-1K 256x256 (val)--
85
Image GenerationCIFAR-10 (train/test)
FID6.35
78
Density EstimationImageNet 32x32 (test)
Bits per Sub-pixel3.53
66
Showing 10 of 103 rows
...

Other info

Follow for update