Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Fast Sampling of Diffusion Models via Operator Learning

About

Diffusion models have found widespread adoption in various areas. However, their sampling process is slow because it requires hundreds to thousands of network evaluations to emulate a continuous process defined by differential equations. In this work, we use neural operators, an efficient method to solve the probability flow differential equations, to accelerate the sampling process of diffusion models. Compared to other fast sampling methods that have a sequential nature, we are the first to propose a parallel decoding method that generates images with only one model forward pass. We propose diffusion model sampling with neural operator (DSNO) that maps the initial condition, i.e., Gaussian distribution, to the continuous-time solution trajectory of the reverse diffusion process. To model the temporal correlations along the trajectory, we introduce temporal convolution layers that are parameterized in the Fourier space into the given diffusion model backbone. We show our method achieves state-of-the-art FID of 3.78 for CIFAR-10 and 7.83 for ImageNet-64 in the one-model-evaluation setting.

Hongkai Zheng, Weili Nie, Arash Vahdat, Kamyar Azizzadenesheli, Anima Anandkumar• 2022

Related benchmarks

TaskDatasetResultRank
Unconditional Image GenerationCIFAR-10
FID3.78
240
Unconditional Image GenerationCIFAR-10 (test)
FID3.78
223
Image GenerationCIFAR-10
FID3.78
203
Unconditional Image GenerationCIFAR-10 unconditional
FID3.78
165
Class-conditional Image GenerationImageNet 64x64
FID7.83
156
Image GenerationImageNet 64x64 resolution (test)
FID7.83
150
Unconditional Image GenerationCIFAR-10 32x32 (test)
FID3.78
137
Class-conditional Image GenerationImageNet 64x64 (test)
FID7.83
91
Image GenerationImageNet 64x64 (val)
FID7.83
48
Unconditional Image GenerationCIFAR-10 32x32 unconditional (test)
FID3.78
33
Showing 10 of 11 rows

Other info

Follow for update