Knowledge Distillation in Iterative Generative Models for Improved Sampling Speed

About

Iterative generative models, such as noise conditional score networks and denoising diffusion probabilistic models, produce high quality samples by gradually denoising an initial noise vector. However, their denoising process has many steps, making them 2-3 orders of magnitude slower than other generative models such as GANs and VAEs. In this paper, we establish a novel connection between knowledge distillation and image generation with a technique that distills a multi-step denoising process into a single step, resulting in a sampling speed similar to other single-step generative models. Our Denoising Student generates high quality samples comparable to GANs on the CIFAR-10 and CelebA datasets, without adversarial training. We demonstrate that our method scales to higher resolutions through experiments on 256 x 256 LSUN. Code and checkpoints are available at https://github.com/tcl9876/Denoising_Student

Eric Luhman, Troy Luhman• 2021

Related benchmarks

Task	Dataset	Result
Image Generation	CIFAR-10 (test)	FID9.36	536
Unconditional Image Generation	CIFAR-10	FID9.36	280
Unconditional Image Generation	CIFAR-10 (test)	FID9.36	223
Image Generation	CIFAR-10	FID9.36	212
Unconditional Image Generation	CIFAR-10 unconditional	FID9.36	209
Image Generation	CIFAR10 32x32 (test)	FID9.36	186
Unconditional Image Generation	CIFAR-10 32x32 (test)	FID9.36	137
Unconditional Generation	CIFAR-10 (test)	FID9.36	102
Image Generation	CIFAR10	FID3	26
Image Generation	CIFAR-10 teacher-generated samples (test)	FID20.7735	4

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord