UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
About
This paper presents UltraEdit, a large-scale (approximately 4 million editing samples), automatically generated dataset for instruction-based image editing. Our key idea is to address the drawbacks in existing image editing datasets like InstructPix2Pix and MagicBrush, and provide a systematic approach to producing massive and high-quality image editing samples. UltraEdit offers several distinct advantages: 1) It features a broader range of editing instructions by leveraging the creativity of large language models (LLMs) alongside in-context editing examples from human raters; 2) Its data sources are based on real images, including photographs and artworks, which provide greater diversity and reduced bias compared to datasets solely generated by text-to-image models; 3) It also supports region-based editing, enhanced by high-quality, automatically produced region annotations. Our experiments show that canonical diffusion-based editing baselines trained on UltraEdit set new records on MagicBrush and Emu-Edit benchmarks. Our analysis further confirms the crucial role of real image anchors and region-based editing data. The dataset, code, and models can be found in https://ultra-editing.github.io.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Editing | ImgEdit-Bench | Overall Score2.7 | 191 | |
| Instructive image editing | EMU Edit (test) | CLIP Image Similarity0.845 | 55 | |
| Single-image editing | GEdit EN (full) | BG Change7.44 | 42 | |
| Instructive image editing | MagicBrush (test) | CLIP Image0.868 | 37 | |
| Instruction-based Image Editing | ImgEdit Bench 1.0 (test) | Add Score3.44 | 37 | |
| Image Editing | AnyEdit (test) | CLIP Score (Input)0.856 | 28 | |
| Instruction-based Image Editing | KRIS Bench 38 (test) | Factual Score66.26 | 27 | |
| Instruction-based Image Editing | RISEBench 49 (test) | Reasoning30.21 | 27 | |
| Image Editing | ImgEdit 1.0 (test) | Add Score3.44 | 27 | |
| Multi-turn image editing | MSE-Bench | Success Rate (Turn 1)67.3 | 26 |