Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Diffusion Tuning: Transferring Diffusion Models via Chain of Forgetting

About

Diffusion models have significantly advanced the field of generative modeling. However, training a diffusion model is computationally expensive, creating a pressing need to adapt off-the-shelf diffusion models for downstream generation tasks. Current fine-tuning methods focus on parameter-efficient transfer learning but overlook the fundamental transfer characteristics of diffusion models. In this paper, we investigate the transferability of diffusion models and observe a monotonous chain of forgetting trend of transferability along the reverse process. Based on this observation and novel theoretical insights, we present Diff-Tuning, a frustratingly simple transfer approach that leverages the chain of forgetting tendency. Diff-Tuning encourages the fine-tuned model to retain the pre-trained knowledge at the end of the denoising chain close to the generated data while discarding the other noise side. We conduct comprehensive experiments to evaluate Diff-Tuning, including the transfer of pre-trained Diffusion Transformer models to eight downstream generations and the adaptation of Stable Diffusion to five control conditions with ControlNet. Diff-Tuning achieves a 26% improvement over standard fine-tuning and enhances the convergence speed of ControlNet by 24%. Notably, parameter-efficient transfer learning techniques for diffusion models can also benefit from Diff-Tuning.

Jincheng Zhong, Xingzhuo Guo, Jiaxiang Dong, Mingsheng Long• 2024

Related benchmarks

TaskDatasetResultRank
Image GenerationOxford Flowers
FID12.64
15
Class-conditioned generationFood101
FID6.05
7
Class-conditioned generationSUN397
FID7.21
7
Class-conditioned generationDF20 Mini
FID12.57
7
Class-conditioned generationCaltech101
FID23.79
7
Class-conditioned generationCUB-200 2011
FID3.5
7
Class-conditioned generationArtBench-10
FID13.85
7
Class-conditioned generationStanford Cars
FID5.37
7
Class-conditioned generationAverage of 8 downstream tasks
FID10.63
7
Controllable Image GenerationSketch
Sudden Convergence Steps3.2
2
Showing 10 of 15 rows

Other info

Follow for update