Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Fast Sampling of Diffusion Models with Exponential Integrator

About

The past few years have witnessed the great success of Diffusion models~(DMs) in generating high-fidelity samples in generative modeling tasks. A major limitation of the DM is its notoriously slow sampling procedure which normally requires hundreds to thousands of time discretization steps of the learned diffusion process to reach the desired accuracy. Our goal is to develop a fast sampling method for DMs with a much less number of steps while retaining high sample quality. To this end, we systematically analyze the sampling procedure in DMs and identify key factors that affect the sample quality, among which the method of discretization is most crucial. By carefully examining the learned diffusion process, we propose Diffusion Exponential Integrator Sampler~(DEIS). It is based on the Exponential Integrator designed for discretizing ordinary differential equations (ODEs) and leverages a semilinear structure of the learned diffusion process to reduce the discretization error. The proposed method can be applied to any DMs and can generate high-fidelity samples in as few as 10 steps. In our experiments, it takes about 3 minutes on one A6000 GPU to generate $50k$ images from CIFAR10. Moreover, by directly using pre-trained DMs, we achieve the state-of-art sampling performance when the number of score function evaluation~(NFE) is limited, e.g., 4.17 FID with 10 NFEs, 3.37 FID, and 9.74 IS with only 15 NFEs on CIFAR10. Code is available at https://github.com/qsh-zh/deis

Qinsheng Zhang, Yongxin Chen• 2022

Related benchmarks

TaskDatasetResultRank
Image GenerationCIFAR-10 (test)
FID4.17
471
Image GenerationImageNet 64x64 resolution (test)
FID3.1
150
Image GenerationCIFAR-10 discrete-time and continuous-time models (test)
FID2.57
92
Image GenerationCIFAR10 50k samples (test)
FID2.86
81
Image GenerationFFHQ 64x64 (test)
FID45.7
69
Image GenerationImageNet 64x64 (val)
FID3.1
48
Unconditional 3D Point Cloud GenerationShapeNet airplane (val)
MMD0.67
40
Image GenerationCelebA
FID6.95
39
Image GenerationCelebA 64x64 discrete-time model (test)
FID2.95
28
Image GenerationCIFAR-10 32x32 EDM (test)
FID47.68
24
Showing 10 of 11 rows

Other info

Follow for update