Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Negative-prompt Inversion: Fast Image Inversion for Editing with Text-guided Diffusion Models

About

In image editing employing diffusion models, it is crucial to preserve the reconstruction fidelity to the original image while changing its style. Although existing methods ensure reconstruction fidelity through optimization, a drawback of these is the significant amount of time required for optimization. In this paper, we propose negative-prompt inversion, a method capable of achieving equivalent reconstruction solely through forward propagation without optimization, thereby enabling ultrafast editing processes. We experimentally demonstrate that the reconstruction fidelity of our method is comparable to that of existing methods, allowing for inversion at a resolution of 512 pixels and with 50 sampling steps within approximately 5 seconds, which is more than 30 times faster than null-text inversion. Reduction of the computation time by the proposed method further allows us to use a larger number of sampling steps in diffusion models to improve the reconstruction fidelity with a moderate increase in computation time.

Daiki Miyake, Akihiro Iohara, Yu Saito, Toshiyuki Tanaka• 2023

Related benchmarks

TaskDatasetResultRank
Image EditingPIE-Bench
PSNR26.21
116
Image EditingPIE-Bench (test)--
46
Image EditingPIE-Bench 1.0 (test)
PSNR28.67
22
Image-to-Image Translation (Appearance Divergence)LAION Mini
Structure Similarity95.2
20
Image-to-Image Translation (Appearance Consistency)LAION Mini
Structure Similarity0.945
20
Image InversionPIE-Bench
Inference Time (s)18.22
6
Image ReconstructionPIE-Bench base model v1.4 (test)
Structure Distance8.47
5
Image EditingCOCO (val)
CLIP Score23.77
4
Showing 8 of 8 rows

Other info

Follow for update