Fast Sampling of Diffusion Models with Exponential Integrator

About

The past few years have witnessed the great success of Diffusion models~(DMs) in generating high-fidelity samples in generative modeling tasks. A major limitation of the DM is its notoriously slow sampling procedure which normally requires hundreds to thousands of time discretization steps of the learned diffusion process to reach the desired accuracy. Our goal is to develop a fast sampling method for DMs with a much less number of steps while retaining high sample quality. To this end, we systematically analyze the sampling procedure in DMs and identify key factors that affect the sample quality, among which the method of discretization is most crucial. By carefully examining the learned diffusion process, we propose Diffusion Exponential Integrator Sampler~(DEIS). It is based on the Exponential Integrator designed for discretizing ordinary differential equations (ODEs) and leverages a semilinear structure of the learned diffusion process to reduce the discretization error. The proposed method can be applied to any DMs and can generate high-fidelity samples in as few as 10 steps. In our experiments, it takes about 3 minutes on one A6000 GPU to generate $50k$ images from CIFAR10. Moreover, by directly using pre-trained DMs, we achieve the state-of-art sampling performance when the number of score function evaluation~(NFE) is limited, e.g., 4.17 FID with 10 NFEs, 3.37 FID, and 9.74 IS with only 15 NFEs on CIFAR10. Code is available at https://github.com/qsh-zh/deis

Qinsheng Zhang, Yongxin Chen• 2022

Related benchmarks

Task	Dataset	Result
Image Generation	CIFAR-10 (test)	FID4.17	536
Image Generation	CIFAR-10	FID7.05	212
Image Generation	CIFAR-10 32x32	FID3.69	151
Image Generation	ImageNet 64x64 resolution (test)	FID3.1	150
Image Generation	ImageNet 64	FID10.6	109
Image Generation	LSUN bedroom	FID20.71	105
Image Generation	CelebA	FID6.95	96
Image Generation	CIFAR-10 discrete-time and continuous-time models (test)	FID2.57	92
Image Generation	LSUN-Bedroom 256 latent space	FID4.39	90
Image Generation	Imagenet-256 latent space	FID5.35	90

Showing 10 of 21 rows

Other info

Follow for update

@wizwand_team Discord