Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Fast ODE-based Sampling for Diffusion Models in Around 5 Steps

About

Sampling from diffusion models can be treated as solving the corresponding ordinary differential equations (ODEs), with the aim of obtaining an accurate solution with as few number of function evaluations (NFE) as possible. Recently, various fast samplers utilizing higher-order ODE solvers have emerged and achieved better performance than the initial first-order one. However, these numerical methods inherently result in certain approximation errors, which significantly degrades sample quality with extremely small NFE (e.g., around 5). In contrast, based on the geometric observation that each sampling trajectory almost lies in a two-dimensional subspace embedded in the ambient space, we propose Approximate MEan-Direction Solver (AMED-Solver) that eliminates truncation errors by directly learning the mean direction for fast diffusion sampling. Besides, our method can be easily used as a plugin to further improve existing ODE-based samplers. Extensive experiments on image synthesis with the resolution ranging from 32 to 512 demonstrate the effectiveness of our method. With only 5 NFE, we achieve 6.61 FID on CIFAR-10, 10.74 FID on ImageNet 64$\times$64, and 13.20 FID on LSUN Bedroom. Our code is available at https://github.com/zju-pi/diff-sampler.

Zhenyu Zhou, Defang Chen, Can Wang, Chun Chen• 2023

Related benchmarks

TaskDatasetResultRank
Image GenerationCIFAR-10
FID4.36
203
Text-to-Image GenerationMS-COCO (val)
FID14.84
202
Class-conditional Image GenerationImageNet 64x64
FID6.66
156
Image GenerationCIFAR-10 32x32
FID4.36
147
Unconditional Image GenerationCIFAR-10 32x32 (test)
FID2.63
137
Image GenerationLSUN bedroom
FID4.19
105
Image GenerationImageNet 64
FID6.66
100
Image GenerationFFHQ
FID6.64
91
Conditional Image GenerationImageNet 64x64 (val)
FID5.44
87
Image GenerationFFHQ 64x64
FID6.64
76
Showing 10 of 24 rows

Other info

Follow for update