Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models

About

Due to the ease of training, ability to scale, and high sample quality, diffusion models (DMs) have become the preferred option for generative modeling, with numerous pre-trained models available for a wide variety of datasets. Containing intricate information about data distributions, pre-trained DMs are valuable assets for downstream applications. In this work, we consider learning from pre-trained DMs and transferring their knowledge to other generative models in a data-free fashion. Specifically, we propose a general framework called Diff-Instruct to instruct the training of arbitrary generative models as long as the generated samples are differentiable with respect to the model parameters. Our proposed Diff-Instruct is built on a rigorous mathematical foundation where the instruction process directly corresponds to minimizing a novel divergence we call Integral Kullback-Leibler (IKL) divergence. IKL is tailored for DMs by calculating the integral of the KL divergence along a diffusion process, which we show to be more robust in comparing distributions with misaligned supports. We also reveal non-trivial connections of our method to existing works such as DreamFusion, and generative adversarial training. To demonstrate the effectiveness and universality of Diff-Instruct, we consider two scenarios: distilling pre-trained diffusion models and refining existing GAN models. The experiments on distilling pre-trained diffusion models show that Diff-Instruct results in state-of-the-art single-step diffusion-based models. The experiments on refining GAN models show that the Diff-Instruct can consistently improve the pre-trained generators of GAN models across various settings.

Weijian Luo, Tianyang Hu, Shifeng Zhang, Jiacheng Sun, Zhenguo Li, Zhihua Zhang• 2023

Related benchmarks

Task	Dataset	Result
Unconditional Image Generation	CIFAR-10	FID4.53	280
Unconditional Image Generation	CIFAR-10 (test)	FID4.53	223
Image Generation	CIFAR-10	FID4.19	212
Unconditional Image Generation	CIFAR-10 unconditional	FID2.71	209
Class-conditional Image Generation	ImageNet 64x64	FID5.57	170
Image Generation	ImageNet 64x64 resolution (test)	FID5.57	150
Unconditional Image Generation	CIFAR-10 32x32 (test)	FID4.53	137
Unconditional Generation	CIFAR-10 (test)	FID4.53	102
Conditional Image Generation	CIFAR10 (test)	Fréchet Inception Distance2.27	92
Class-conditional Image Generation	ImageNet 64x64 (test)	FID5.57	91

Showing 10 of 21 rows

Other info

Code

Follow for update

@wizwand_team Discord