Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Efficient Zero-Shot Inpainting with Decoupled Diffusion Guidance

About

Diffusion models have emerged as powerful priors for image editing tasks such as inpainting and local modification, where the objective is to generate realistic content that remains consistent with observed regions. In particular, zero-shot approaches that leverage a pretrained diffusion model, without any retraining, have been shown to achieve highly effective reconstructions. However, state-of-the-art zero-shot methods typically rely on a sequence of surrogate likelihood functions, whose scores are used as proxies for the ideal score. This procedure however requires vector-Jacobian products through the denoiser at every reverse step, introducing significant memory and runtime overhead. To address this issue, we propose a new likelihood surrogate that yields simple and efficient to sample Gaussian posterior transitions, sidestepping the backpropagation through the denoiser network. Our extensive experiments show that our method achieves strong observation consistency compared with fine-tuned baselines and produces coherent, high-quality reconstructions, all while significantly reducing inference cost. Code is available at https://github.com/YazidJanati/ding.

Badr Moufad, Navid Bagheri Shouraki, Alain Oliviero Durmus, Thomas Hirtz, Eric Moulines, Jimmy Olsson, Yazid Janati• 2025

Related benchmarks

TaskDatasetResultRank
Image InpaintingFFHQ DIV2K (val)
Latency (s)2.9
11
InpaintingFFHQ 768 x 768 5k samples
FID (Half)9.6
11
InpaintingDIV2K 768 x 768
FID (Half Crop)39.2
11
Image InpaintingPIE-Bench (556 samples)
FID61.4
11
Showing 4 of 4 rows

Other info

Follow for update