Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

VIPaint: Image Inpainting with Pre-Trained Diffusion Models via Variational Inference

About

Diffusion probabilistic models learn to remove noise added during training, generating novel data (e.g., images) from Gaussian noise through sequential denoising. However, conditioning the generative process on corrupted or masked images is challenging. While various methods have been proposed for inpainting masked images with diffusion priors, they often fail to produce samples from the true conditional distribution, especially for large masked regions. Many baselines also cannot be applied to latent diffusion models which generate high-quality images with much lower computational cost. We propose a hierarchical variational inference algorithm that optimizes a non-Gaussian Markov approximation of the true diffusion posterior. Our VIPaint method outperforms existing approaches to inpainting, producing diverse high-quality imputations even for state-of-the-art text-conditioned latent diffusion models, and is also effective for other inverse problems like deblurring and superresolution.

Sakshi Agarwal, Gabriel Hope, Jimin Heo, Erik B. Sudderth• 2024

Related benchmarks

TaskDatasetResultRank
Super-ResolutionImageNet 256
PSNR18.9
50
Image InpaintingImageNet 64x64 (test)
PSNR13.33
16
Image InpaintingImageNet64 Random Mask (test)
PSNR13.33
8
Image InpaintingImageNet64 Rotated Window Mask (test)
PSNR9.24
8
Image InpaintingLSUN Churches 256 Random Mask
LPIPS0.44
7
Gaussian deblurImageNet 64
LPIPS0.31
6
Image InpaintingImageNet256 Rotated Window
PSNR9.43
5
Image InpaintingImageNet256 Random Mask
PSNR10.04
5
Image InpaintingLSUN Churches 256 Rotated Window
PSNR8.39
5
Image InpaintingLSUN-Churches256 Small Mask
PSNR16.18
5
Showing 10 of 13 rows

Other info

Follow for update