Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks

About

Diffusion models trained on large-scale datasets have achieved remarkable progress in image synthesis. However, due to the randomness in the diffusion process, they often struggle with handling diverse low-level tasks that require details preservation. To overcome this limitation, we present a new Diff-Plugin framework to enable a single pre-trained diffusion model to generate high-fidelity results across a variety of low-level tasks. Specifically, we first propose a lightweight Task-Plugin module with a dual branch design to provide task-specific priors, guiding the diffusion process in preserving image content. We then propose a Plugin-Selector that can automatically select different Task-Plugins based on the text instruction, allowing users to edit images by indicating multiple low-level tasks with natural language. We conduct extensive experiments on 8 low-level vision tasks. The results demonstrate the superiority of Diff-Plugin over existing methods, particularly in real-world scenarios. Our ablations further validate that Diff-Plugin is stable, schedulable, and supports robust training across different dataset sizes.

Yuhao Liu, Zhanghan Ke, Fang Liu, Nanxuan Zhao, Rynson W.H. Lau• 2024

Related benchmarks

TaskDatasetResultRank
Image DeblurringGoPro (test)
PSNR22.88
672
Image DeblurringRealBlur-J (test)
PSNR25.77
259
Image DeblurringHIDE (test)
PSNR21.94
242
DeblurringRealBlur-R (test)
PSNR32.64
170
Low-light Image EnhancementLOL (test)
PSNR19.836
161
Low-light Image EnhancementVV (test)
NIQE3.672
33
Image RestorationPOLED
PSNR17.01
26
DehazingRESIDE
FID34.68
25
DeblurringRealBlur-J
FID51.81
17
Derainingreal (test)
FID50.55
17
Showing 10 of 28 rows

Other info

Code

Follow for update