Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Improving and generalizing flow-based generative models with minibatch optimal transport

About

Continuous normalizing flows (CNFs) are an attractive generative modeling technique, but they have been held back by limitations in their simulation-based maximum likelihood training. We introduce the generalized conditional flow matching (CFM) technique, a family of simulation-free training objectives for CNFs. CFM features a stable regression objective like that used to train the stochastic flow in diffusion models but enjoys the efficient inference of deterministic flow models. In contrast to both diffusion models and prior CNF training algorithms, CFM does not require the source distribution to be Gaussian or require evaluation of its density. A variant of our objective is optimal transport CFM (OT-CFM), which creates simpler flows that are more stable to train and lead to faster inference, as evaluated in our experiments. Furthermore, we show that when the true OT plan is available, our OT-CFM method approximates dynamic OT. Training CNFs with CFM improves results on a variety of conditional and unconditional generation tasks, such as inferring single cell dynamics, unsupervised image translation, and Schr\"odinger bridge inference.

Alexander Tong, Kilian Fatras, Nikolay Malkin, Guillaume Huguet, Yanlei Zhang, Jarrid Rector-Brooks, Guy Wolf, Yoshua Bengio• 2023

Related benchmarks

TaskDatasetResultRank
Unconditional Image GenerationCIFAR-10
FID3.57
280
Image GenerationCIFAR-10
FID4.4
212
Unconditional Image GenerationCIFAR-10 unconditional
FID4.49
209
Class-conditional Image GenerationImageNet 64x64
FID5.36
170
Image GenerationFFHQ
FID8.5
83
Unconditional Image GenerationCIFAR-10 32 x 32
FID3.68
71
Class-conditional Image GenerationImageNet class-conditional 256x256
Inception Score (IS)247.7
61
Anomaly DetectionSMAP (test)
Precision89.3
35
Unconditional Image GenerationCIFAR10--
33
Intermediate distribution restorationSingle-cell data (intermediate time points ti for i in {1, 2, 3})
W1 Score0.79
28
Showing 10 of 81 rows
...

Other info

Follow for update