Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks

About

Diffusion models trained on large-scale datasets have achieved remarkable progress in image synthesis. However, due to the randomness in the diffusion process, they often struggle with handling diverse low-level tasks that require details preservation. To overcome this limitation, we present a new Diff-Plugin framework to enable a single pre-trained diffusion model to generate high-fidelity results across a variety of low-level tasks. Specifically, we first propose a lightweight Task-Plugin module with a dual branch design to provide task-specific priors, guiding the diffusion process in preserving image content. We then propose a Plugin-Selector that can automatically select different Task-Plugins based on the text instruction, allowing users to edit images by indicating multiple low-level tasks with natural language. We conduct extensive experiments on 8 low-level vision tasks. The results demonstrate the superiority of Diff-Plugin over existing methods, particularly in real-world scenarios. Our ablations further validate that Diff-Plugin is stable, schedulable, and supports robust training across different dataset sizes.

Yuhao Liu, Zhanghan Ke, Fang Liu, Nanxuan Zhao, Rynson W.H. Lau• 2024

Related benchmarks

TaskDatasetResultRank
DehazingRESIDE
FID34.68
25
DeblurringRealBlur-J
FID51.81
17
Derainingreal (test)
FID50.55
17
DesnowingRealistic
FID34.3
17
Image DeblurringRWBI (test)
NIQE5.988
17
Face RestorationLFW
FID20.07
11
Low-light enhancementmerged low
FID48.98
11
DemoireingLCDMoire
FID29.77
11
Highlight RemovalSHIQ
FID12.58
11
Low-light enhancementLow-light enhancement dataset
LPIPS0.227
11
Showing 10 of 16 rows

Other info

Code

Follow for update