Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models

About

Denoising diffusion probabilistic models (DDPM) have shown remarkable performance in unconditional image generation. However, due to the stochasticity of the generative process in DDPM, it is challenging to generate images with the desired semantics. In this work, we propose Iterative Latent Variable Refinement (ILVR), a method to guide the generative process in DDPM to generate high-quality images based on a given reference image. Here, the refinement of the generative process in DDPM enables a single DDPM to sample images from various sets directed by the reference image. The proposed ILVR method generates high-quality images while controlling the generation. The controllability of our method allows adaptation of a single DDPM without any additional learning in various image generation tasks, such as generation from various downsampling factors, multi-domain image translation, paint-to-image, and editing with scribbles.

Jooyoung Choi, Sungwon Kim, Yonghyun Jeong, Youngjune Gwon, Sungroh Yoon• 2021

Related benchmarks

TaskDatasetResultRank
Gaussian DeblurringFFHQ 256x256 (val)
FID109
32
Super-Resolution (4x)ImageNet
PSNR27.4
30
Image InpaintingFFHQ 256x256 (val)
FID76.54
30
Super-ResolutionFFHQ 256x256 (val)
LPIPS0.563
19
Unpaired Image-to-Image TranslationCat → Dog v1 (test)
FID74.37
14
Super-Resolution (4x)CelebA
PSNR31.59
7
Motion DeblurringFFHQ 256x256 (val)
FID292.2
7
Unpaired Image-to-Image TranslationWild → Dog v1 (test)
FID75.33
5
Unpaired Image-to-Image TranslationMale → Female v1 (test)
FID46.12
5
Stroke Guided Image SynthesisCustom dataset 800 image-prompt pairs
F(x, y)108.2
4
Showing 10 of 11 rows

Other info

Follow for update