Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models

About

Denoising diffusion probabilistic models (DDPM) have shown remarkable performance in unconditional image generation. However, due to the stochasticity of the generative process in DDPM, it is challenging to generate images with the desired semantics. In this work, we propose Iterative Latent Variable Refinement (ILVR), a method to guide the generative process in DDPM to generate high-quality images based on a given reference image. Here, the refinement of the generative process in DDPM enables a single DDPM to sample images from various sets directed by the reference image. The proposed ILVR method generates high-quality images while controlling the generation. The controllability of our method allows adaptation of a single DDPM without any additional learning in various image generation tasks, such as generation from various downsampling factors, multi-domain image translation, paint-to-image, and editing with scribbles.

Jooyoung Choi, Sungwon Kim, Yonghyun Jeong, Youngjune Gwon, Sungroh Yoon• 2021

Related benchmarks

TaskDatasetResultRank
Super-Resolution (4x)ImageNet
PSNR27.4
57
Gaussian DeblurringFFHQ 256x256 (val)
LPIPS0.403
48
Image InpaintingFFHQ 256x256 (val)
FID76.54
42
Motion DeblurringFFHQ 256x256 (val)
FID292.2
19
Super-ResolutionFFHQ 256x256 (val)
LPIPS0.563
19
Super-Resolution (4x)CelebA
PSNR31.59
16
3D InpaintingToys4k Inpainting Part
CLIP Score30.61
14
3D ReconstructionToys4k (Preserved Part)
Appearance PSNR20.62
14
Unpaired Image-to-Image TranslationCat → Dog v1 (test)
FID74.37
14
BlendingMTG-Jamendo (test)
FAD2.696
10
Showing 10 of 24 rows

Other info

Follow for update