Diffusion Tuning: Transferring Diffusion Models via Chain of Forgetting

About

Diffusion models have significantly advanced the field of generative modeling. However, training a diffusion model is computationally expensive, creating a pressing need to adapt off-the-shelf diffusion models for downstream generation tasks. Current fine-tuning methods focus on parameter-efficient transfer learning but overlook the fundamental transfer characteristics of diffusion models. In this paper, we investigate the transferability of diffusion models and observe a monotonous chain of forgetting trend of transferability along the reverse process. Based on this observation and novel theoretical insights, we present Diff-Tuning, a frustratingly simple transfer approach that leverages the chain of forgetting tendency. Diff-Tuning encourages the fine-tuned model to retain the pre-trained knowledge at the end of the denoising chain close to the generated data while discarding the other noise side. We conduct comprehensive experiments to evaluate Diff-Tuning, including the transfer of pre-trained Diffusion Transformer models to eight downstream generations and the adaptation of Stable Diffusion to five control conditions with ControlNet. Diff-Tuning achieves a 26% improvement over standard fine-tuning and enhances the convergence speed of ControlNet by 24%. Notably, parameter-efficient transfer learning techniques for diffusion models can also benefit from Diff-Tuning.

Jincheng Zhong, Xingzhuo Guo, Jiaxiang Dong, Mingsheng Long• 2024

Related benchmarks

Task	Dataset	Result
Image Generation	Oxford Flowers	FID12.64	15
Class-conditioned generation	Food101	FID6.05	7
Class-conditioned generation	SUN397	FID7.21	7
Class-conditioned generation	DF20 Mini	FID12.57	7
Class-conditioned generation	Caltech101	FID23.79	7
Class-conditioned generation	CUB-200 2011	FID3.5	7
Class-conditioned generation	ArtBench-10	FID13.85	7
Class-conditioned generation	Stanford Cars	FID5.37	7
Class-conditioned generation	Average of 8 downstream tasks	FID10.63	7
Controllable Image Generation	Sketch	Sudden Convergence Steps3.2	2

Showing 10 of 15 rows

Other info

Follow for update

@wizwand_team Discord