Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Bespoke Solvers for Generative Flow Models

About

Diffusion or flow-based models are powerful generative paradigms that are notoriously hard to sample as samples are defined as solutions to high-dimensional Ordinary or Stochastic Differential Equations (ODEs/SDEs) which require a large Number of Function Evaluations (NFE) to approximate well. Existing methods to alleviate the costly sampling process include model distillation and designing dedicated ODE solvers. However, distillation is costly to train and sometimes can deteriorate quality, while dedicated solvers still require relatively large NFE to produce high quality samples. In this paper we introduce "Bespoke solvers", a novel framework for constructing custom ODE solvers tailored to the ODE of a given pre-trained flow model. Our approach optimizes an order consistent and parameter-efficient solver (e.g., with 80 learnable parameters), is trained for roughly 1% of the GPU time required for training the pre-trained model, and significantly improves approximation and generation quality compared to dedicated solvers. For example, a Bespoke solver for a CIFAR10 model produces samples with Fr\'echet Inception Distance (FID) of 2.73 with 10 NFE, and gets to 1% of the Ground Truth (GT) FID (2.59) for this model with only 20 NFE. On the more challenging ImageNet-64$\times$64, Bespoke samples at 2.2 FID with 10 NFE, and gets within 2% of GT FID (1.71) with 20 NFE.

Neta Shaul, Juan Perez, Ricky T. Q. Chen, Ali Thabet, Albert Pumarola, Yaron Lipman• 2023

Related benchmarks

TaskDatasetResultRank
Unconditional Layout GenerationRico
FID21.54
55
Image GenerationImageNet 256x256 with FlowDCN (val)
FID7.66
48
Image GenerationMS-COCO 512x512 with Stable Diffusion (val)
FID13.26
48
Image GenerationCIFAR-10 32x32 with ReFlow (test)
FID9.25
48
Unconditional Image GenerationCIFAR-10 unconditional 32x32
FID2.73
17
Showing 5 of 5 rows

Other info

Follow for update