Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FireFlow: Fast Inversion of Rectified Flow for Image Semantic Editing

About

Though Rectified Flows (ReFlows) with distillation offers a promising way for fast sampling, its fast inversion transforms images back to structured noise for recovery and following editing remains unsolved. This paper introduces FireFlow, a simple yet effective zero-shot approach that inherits the startling capacity of ReFlow-based models (such as FLUX) in generation while extending its capabilities to accurate inversion and editing in $8$ steps. We first demonstrate that a carefully designed numerical solver is pivotal for ReFlow inversion, enabling accurate inversion and reconstruction with the precision of a second-order solver while maintaining the practical efficiency of a first-order Euler method. This solver achieves a $3\times$ runtime speedup compared to state-of-the-art ReFlow inversion and editing techniques, while delivering smaller reconstruction errors and superior editing results in a training-free mode. The code is available at $\href{https://github.com/HolmesShuan/FireFlow}{this URL}$.

Yingying Deng, Xiangyu He, Changwang Mei, Peisong Wang, Fan Tang• 2024

Related benchmarks

TaskDatasetResultRank
Image EditingPIE-Bench
PSNR23.28
166
Text-to-Image GenerationMS-COCO 2014 (val)
FID25.16
137
Image EditingPIE-Bench (test)
PSNR23.33
55
Image EditingPIE-Bench
PSNR23.03
25
Image Semantic EditingPIE-Bench (test)
PSNR23.28
18
Image EditingPIE
Distance28.3
18
Image ReconstructionPIE-Bench
MSE1.58
15
Image EditingEditEval v2
LPIPS0.385
14
Image Editing1024 x 1024 resolution
Runtime (4090, s)29.85
14
Layout-free HOI editingIEBench
Editability-Identity0.451
14
Showing 10 of 14 rows

Other info

Follow for update