UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models
About
Flow matching models have emerged as a strong alternative to diffusion models, but existing inversion and editing methods designed for diffusion are often ineffective or inapplicable to them. The straight-line, non-crossing trajectories of flow models pose challenges for diffusion-based approaches but also open avenues for novel solutions. In this paper, we introduce a predictor-corrector-based framework for inversion and editing in flow models. First, we propose Uni-Inv, an effective inversion method designed for accurate reconstruction. Building on this, we extend the concept of delayed injection to flow models and introduce Uni-Edit, a region-aware, robust image editing approach. Our methodology is tuning-free, model-agnostic, efficient, and effective, enabling diverse edits while ensuring strong preservation of edit-irrelevant regions. Extensive experiments across various generative models demonstrate the superiority and generalizability of Uni-Inv and Uni-Edit, even under low-cost settings. Project page: https://uniedit-flow.github.io/
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Editing | PIE-Bench | PSNR29.54 | 215 | |
| Image Editing | PIE-Bench (test) | PSNR29.54 | 55 | |
| Image Reconstruction | PIE-Bench | MSE1.27 | 15 | |
| 3D Editing | 3D Editing | Time (s)34.86 | 11 | |
| Video Editing | FiVE-Bench (test) | Structural Distance18.31 | 11 | |
| Conditional Image Reconstruction | PIE-Bench | Rank1.25 | 10 | |
| Conditional Real Image Inversion & Reconstruction | Conceptual Captions (val) | MSE7.86 | 10 | |
| Unconditional Real Image Inversion & Reconstruction | Conceptual Captions (val) | MSE8.85 | 10 | |
| Video Editing | Anchor-Bench | CLIP Temporal Score24.69 | 8 | |
| Image Editing | PIE-Bench FLUX.1 (dev) | Rank4.62 | 8 |