Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ReNoise: Real Image Inversion Through Iterative Noising

About

Recent advancements in text-guided diffusion models have unlocked powerful image manipulation capabilities. However, applying these methods to real images necessitates the inversion of the images into the domain of the pretrained diffusion model. Achieving faithful inversion remains a challenge, particularly for more recent models trained to generate images with a small number of denoising steps. In this work, we introduce an inversion method with a high quality-to-operation ratio, enhancing reconstruction accuracy without increasing the number of operations. Building on reversing the diffusion sampling process, our method employs an iterative renoising mechanism at each inversion sampling step. This mechanism refines the approximation of a predicted point along the forward diffusion trajectory, by iteratively applying the pretrained diffusion model, and averaging these predictions. We evaluate the performance of our ReNoise technique using various sampling algorithms and models, including recent accelerated diffusion models. Through comprehensive evaluations and comparisons, we show its effectiveness in terms of both accuracy and speed. Furthermore, we confirm that our method preserves editability by demonstrating text-driven image editing on real images.

Daniel Garibi, Or Patashnik, Andrey Voynov, Hadar Averbuch-Elor, Daniel Cohen-Or• 2024

Related benchmarks

TaskDatasetResultRank
Image EditingPIE-Bench
PSNR20.85
116
Image ReconstructionCOCO 2017 (val)
PSNR31.025
54
Image EditingPIE-Bench (test)
PSNR20.28
46
Image EditingPIE-Bench 1.0 (test)
PSNR20.28
22
Image-to-Image Translation (Appearance Divergence)LAION Mini
Structure Similarity95.3
20
Image-to-Image Translation (Appearance Consistency)LAION Mini
Structure Similarity0.938
20
Text-Guided Image EditingGeneral Image Editing
Speedup5.08
12
Diffusion Inversionedges2shoes
PSNR7.95
9
Diffusion InversionBBBC021 x16
PSNR16.2
9
Image Reconstructionnocaps (val)
LPIPS0.241
5
Showing 10 of 11 rows

Other info

Follow for update