Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Flow Matching for Generative Modeling

About

We introduce a new paradigm for generative modeling built on Continuous Normalizing Flows (CNFs), allowing us to train CNFs at unprecedented scale. Specifically, we present the notion of Flow Matching (FM), a simulation-free approach for training CNFs based on regressing vector fields of fixed conditional probability paths. Flow Matching is compatible with a general family of Gaussian probability paths for transforming between noise and data samples -- which subsumes existing diffusion paths as specific instances. Interestingly, we find that employing FM with diffusion paths results in a more robust and stable alternative for training diffusion models. Furthermore, Flow Matching opens the door to training CNFs with other, non-diffusion probability paths. An instance of particular interest is using Optimal Transport (OT) displacement interpolation to define the conditional probability paths. These paths are more efficient than diffusion paths, provide faster training and sampling, and result in better generalization. Training CNFs using Flow Matching on ImageNet leads to consistently better performance than alternative diffusion-based methods in terms of both likelihood and sample quality, and allows fast and reliable sample generation using off-the-shelf numerical ODE solvers.

Yaron Lipman, Ricky T. Q. Chen, Heli Ben-Hamu, Maximilian Nickel, Matt Le• 2022

Related benchmarks

TaskDatasetResultRank
Text-to-Image GenerationGenEval
Overall Score81
391
Text-to-Image GenerationGenEval
GenEval Score77
360
Text-to-Image GenerationDPG-Bench
Overall Score83.67
265
Unconditional Image GenerationCIFAR-10
FID2.63
240
Image GenerationCIFAR-10
FID4.35
203
Inverse designMEG (test)
MAE1.68
168
Unconditional Image GenerationCIFAR-10 unconditional
FID6.35
165
Text-to-Audio GenerationAudioCaps (test)
FAD1.22
154
Image GenerationCIFAR-10 32x32
FID2.99
147
Density EstimationCIFAR-10 (test)
Bits/dim2.99
134
Showing 10 of 170 rows
...

Other info

Follow for update