ImgEdit: A Unified Image Editing Dataset and Benchmark

About

Recent advancements in generative models have enabled high-fidelity text-to-image generation. However, open-source image-editing models still lag behind their proprietary counterparts, primarily due to limited high-quality data and insufficient benchmarks. To overcome these limitations, we introduce ImgEdit, a large-scale, high-quality image-editing dataset comprising 1.2 million carefully curated edit pairs, which contain both novel and complex single-turn edits, as well as challenging multi-turn tasks. To ensure the data quality, we employ a multi-stage pipeline that integrates a cutting-edge vision-language model, a detection model, a segmentation model, alongside task-specific in-painting procedures and strict post-processing. ImgEdit surpasses existing datasets in both task novelty and data quality. Using ImgEdit, we train ImgEdit-E1, an editing model using Vision Language Model to process the reference image and editing prompt, which outperforms existing open-source models on multiple tasks, highlighting the value of ImgEdit and model design. For comprehensive evaluation, we introduce ImgEdit-Bench, a benchmark designed to evaluate image editing performance in terms of instruction adherence, editing quality, and detail preservation. It includes a basic testsuite, a challenging single-turn suite, and a dedicated multi-turn suite. We evaluate both open-source and proprietary models, as well as ImgEdit-E1, providing deep analysis and actionable insights into the current behavior of image-editing models. The source data are publicly available on https://github.com/PKU-YuanGroup/ImgEdit.

Yang Ye, Xianyi He, Zongjian Li, Bin Lin, Shenghai Yuan, Zhiyuan Yan, Bohan Hou, Li Yuan• 2025

Related benchmarks

Task	Dataset	Result
Instructional Image Editing	CV-Arena 12K examples 1.0	Elo Rating1.00e+3	84
Single-image editing	GEdit EN (full)	BG Change7.3	42
Instruction-based Image Editing	KRIS Bench 38 (test)	Factual Score69.61	27
Instruction-based Image Editing	RISEBench 49 (test)	Reasoning35.42	27
Image Editing Quality Evaluation	Various Image Editing Datasets	Instruction Adherence Score3.26	12
Fidelity Assessment	ImgEdit-Bench	Accuracy30	2
Fidelity Assessment	GEdit-Bench	Accuracy52	2

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord