Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

UltraEdit: Instruction-based Fine-Grained Image Editing at Scale

About

This paper presents UltraEdit, a large-scale (approximately 4 million editing samples), automatically generated dataset for instruction-based image editing. Our key idea is to address the drawbacks in existing image editing datasets like InstructPix2Pix and MagicBrush, and provide a systematic approach to producing massive and high-quality image editing samples. UltraEdit offers several distinct advantages: 1) It features a broader range of editing instructions by leveraging the creativity of large language models (LLMs) alongside in-context editing examples from human raters; 2) Its data sources are based on real images, including photographs and artworks, which provide greater diversity and reduced bias compared to datasets solely generated by text-to-image models; 3) It also supports region-based editing, enhanced by high-quality, automatically produced region annotations. Our experiments show that canonical diffusion-based editing baselines trained on UltraEdit set new records on MagicBrush and Emu-Edit benchmarks. Our analysis further confirms the crucial role of real image anchors and region-based editing data. The dataset, code, and models can be found in https://ultra-editing.github.io.

Haozhe Zhao, Xiaojian Ma, Liang Chen, Shuzheng Si, Rujie Wu, Kaikai An, Peiyu Yu, Minjia Zhang, Qing Li, Baobao Chang• 2024

Related benchmarks

TaskDatasetResultRank
Image EditingImgEdit-Bench
Overall Score2.7
191
Instructive image editingEMU Edit (test)
CLIP Image Similarity0.845
55
Single-image editingGEdit EN (full)
BG Change7.44
42
Instructive image editingMagicBrush (test)
CLIP Image0.868
37
Instruction-based Image EditingImgEdit Bench 1.0 (test)
Add Score3.44
37
Image EditingAnyEdit (test)
CLIP Score (Input)0.856
28
Instruction-based Image EditingKRIS Bench 38 (test)
Factual Score66.26
27
Instruction-based Image EditingRISEBench 49 (test)
Reasoning30.21
27
Image EditingImgEdit 1.0 (test)
Add Score3.44
27
Multi-turn image editingMSE-Bench
Success Rate (Turn 1)67.3
26
Showing 10 of 65 rows

Other info

Code

Follow for update