Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models

About

Due to the ease of training, ability to scale, and high sample quality, diffusion models (DMs) have become the preferred option for generative modeling, with numerous pre-trained models available for a wide variety of datasets. Containing intricate information about data distributions, pre-trained DMs are valuable assets for downstream applications. In this work, we consider learning from pre-trained DMs and transferring their knowledge to other generative models in a data-free fashion. Specifically, we propose a general framework called Diff-Instruct to instruct the training of arbitrary generative models as long as the generated samples are differentiable with respect to the model parameters. Our proposed Diff-Instruct is built on a rigorous mathematical foundation where the instruction process directly corresponds to minimizing a novel divergence we call Integral Kullback-Leibler (IKL) divergence. IKL is tailored for DMs by calculating the integral of the KL divergence along a diffusion process, which we show to be more robust in comparing distributions with misaligned supports. We also reveal non-trivial connections of our method to existing works such as DreamFusion, and generative adversarial training. To demonstrate the effectiveness and universality of Diff-Instruct, we consider two scenarios: distilling pre-trained diffusion models and refining existing GAN models. The experiments on distilling pre-trained diffusion models show that Diff-Instruct results in state-of-the-art single-step diffusion-based models. The experiments on refining GAN models show that the Diff-Instruct can consistently improve the pre-trained generators of GAN models across various settings.

Weijian Luo, Tianyang Hu, Shifeng Zhang, Jiacheng Sun, Zhenguo Li, Zhihua Zhang• 2023

Related benchmarks

TaskDatasetResultRank
Unconditional Image GenerationCIFAR-10 (test)
FID4.53
216
Unconditional Image GenerationCIFAR-10
FID4.53
171
Unconditional Image GenerationCIFAR-10 unconditional
FID2.71
159
Image GenerationImageNet 64x64 resolution (test)
FID5.57
150
Class-conditional Image GenerationImageNet 64x64
FID5.57
126
Unconditional GenerationCIFAR-10 (test)
FID4.53
102
Image GenerationCIFAR-10
FID4.19
95
Unconditional Image GenerationCIFAR-10 32x32 (test)
FID4.53
94
Class-conditional Image GenerationImageNet 64x64 (test)
FID5.57
86
Conditional Image GenerationCIFAR-10
FID2.27
71
Showing 10 of 17 rows

Other info

Code

Follow for update