Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks

About

Diffusion models trained on large-scale datasets have achieved remarkable progress in image synthesis. However, due to the randomness in the diffusion process, they often struggle with handling diverse low-level tasks that require details preservation. To overcome this limitation, we present a new Diff-Plugin framework to enable a single pre-trained diffusion model to generate high-fidelity results across a variety of low-level tasks. Specifically, we first propose a lightweight Task-Plugin module with a dual branch design to provide task-specific priors, guiding the diffusion process in preserving image content. We then propose a Plugin-Selector that can automatically select different Task-Plugins based on the text instruction, allowing users to edit images by indicating multiple low-level tasks with natural language. We conduct extensive experiments on 8 low-level vision tasks. The results demonstrate the superiority of Diff-Plugin over existing methods, particularly in real-world scenarios. Our ablations further validate that Diff-Plugin is stable, schedulable, and supports robust training across different dataset sizes.

Yuhao Liu, Zhanghan Ke, Fang Liu, Nanxuan Zhao, Rynson W.H. Lau• 2024

Related benchmarks

Task	Dataset	Result
Image Deblurring	GoPro (test)	PSNR22.88	672
Image Deblurring	RealBlur-J (test)	PSNR25.77	259
Image Deblurring	HIDE (test)	PSNR21.94	242
Deblurring	RealBlur-R (test)	PSNR32.64	170
Low-light Image Enhancement	LOL (test)	PSNR19.836	161
Low-light Image Enhancement	VV (test)	NIQE3.672	33
Image Restoration	POLED	PSNR17.01	26
Dehazing	RESIDE	FID34.68	25
Deblurring	RealBlur-J	FID51.81	17
Deraining	real (test)	FID50.55	17

Showing 10 of 28 rows

Other info

Code

Follow for update

@wizwand_team Discord