Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing

About

Text-guided image editing is widely needed in daily life, ranging from personal use to professional applications such as Photoshop. However, existing methods are either zero-shot or trained on an automatically synthesized dataset, which contains a high volume of noise. Thus, they still require lots of manual tuning to produce desirable outcomes in practice. To address this issue, we introduce MagicBrush (https://osu-nlp-group.github.io/MagicBrush/), the first large-scale, manually annotated dataset for instruction-guided real image editing that covers diverse scenarios: single-turn, multi-turn, mask-provided, and mask-free editing. MagicBrush comprises over 10K manually annotated triplets (source image, instruction, target image), which supports trainining large-scale text-guided image editing models. We fine-tune InstructPix2Pix on MagicBrush and show that the new model can produce much better images according to human evaluation. We further conduct extensive experiments to evaluate current image editing baselines from multiple dimensions including quantitative, qualitative, and human evaluations. The results reveal the challenging nature of our dataset and the gap between current baselines and real-world editing needs.

Kai Zhang, Lingbo Mo, Wenhu Chen, Huan Sun, Yu Su• 2023

Related benchmarks

TaskDatasetResultRank
Image EditingImgEdit-Bench
Overall Score1.9
191
Image EditingPIE-Bench
PSNR26.85
166
Image EditingGEdit-Bench
Semantic Consistency4.68
92
Image EditingGEdit-Bench English
G_O (Overall Quality)4.52
84
Image EditingKRIS-Bench
Factual Knowledge Score41.84
74
Image EditingGEdit-Bench-EN (full)
G-Score (O)4.52
66
Instructive image editingEMU Edit (test)
CLIP Image Similarity0.867
55
Single-image editingGEdit EN (full)
BG Change6.17
42
Instructive image editingMagicBrush (test)
CLIP Image0.883
37
Instruction-based Image EditingImgEdit Bench 1.0 (test)
Add Score2.84
37
Showing 10 of 95 rows
...

Other info

Code

Follow for update