FireFlow: Fast Inversion of Rectified Flow for Image Semantic Editing
About
Though Rectified Flows (ReFlows) with distillation offers a promising way for fast sampling, its fast inversion transforms images back to structured noise for recovery and following editing remains unsolved. This paper introduces FireFlow, a simple yet effective zero-shot approach that inherits the startling capacity of ReFlow-based models (such as FLUX) in generation while extending its capabilities to accurate inversion and editing in $8$ steps. We first demonstrate that a carefully designed numerical solver is pivotal for ReFlow inversion, enabling accurate inversion and reconstruction with the precision of a second-order solver while maintaining the practical efficiency of a first-order Euler method. This solver achieves a $3\times$ runtime speedup compared to state-of-the-art ReFlow inversion and editing techniques, while delivering smaller reconstruction errors and superior editing results in a training-free mode. The code is available at $\href{https://github.com/HolmesShuan/FireFlow}{this URL}$.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Editing | PIE-Bench | PSNR23.28 | 166 | |
| Text-to-Image Generation | MS-COCO 2014 (val) | FID25.16 | 137 | |
| Image Editing | PIE-Bench (test) | PSNR23.33 | 55 | |
| Image Editing | PIE-Bench | PSNR23.03 | 25 | |
| Image Semantic Editing | PIE-Bench (test) | PSNR23.28 | 18 | |
| Image Editing | PIE | Distance28.3 | 18 | |
| Image Reconstruction | PIE-Bench | MSE1.58 | 15 | |
| Image Editing | EditEval v2 | LPIPS0.385 | 14 | |
| Image Editing | 1024 x 1024 resolution | Runtime (4090, s)29.85 | 14 | |
| Layout-free HOI editing | IEBench | Editability-Identity0.451 | 14 |