Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea

About

Instruction-based image editing aims to modify specific image elements with natural language instructions. However, current models in this domain often struggle to accurately execute complex user instructions, as they are trained on low-quality data with limited editing types. We present AnyEdit, a comprehensive multi-modal instruction editing dataset, comprising 2.5 million high-quality editing pairs spanning over 20 editing types and five domains. We ensure the diversity and quality of the AnyEdit collection through three aspects: initial data diversity, adaptive editing process, and automated selection of editing results. Using the dataset, we further train a novel AnyEdit Stable Diffusion with task-aware routing and learnable task embedding for unified image editing. Comprehensive experiments on three benchmark datasets show that AnyEdit consistently boosts the performance of diffusion-based editing models. This presents prospects for developing instruction-driven image editing models that support human creativity.

Qifan Yu, Wei Chow, Zhongqi Yue, Kaihang Pan, Yang Wu, Xiaoyang Wan, Juncheng Li, Siliang Tang, Hanwang Zhang, Yueting Zhuang• 2024

Related benchmarks

TaskDatasetResultRank
Image EditingImgEdit-Bench
Overall Score2.45
132
Image EditingGEdit-Bench English
G_O (Overall Quality)3.21
73
Image EditingKRIS-Bench
Factual Knowledge Score39.26
65
Instructive image editingEMU Edit (test)
CLIP Image Similarity0.872
46
Image EditingGEdit-Bench
Semantic Consistency3.18
46
Instruction-based Image EditingImgEdit Bench 1.0 (test)
Add Score3.18
37
Image EditingAnyEdit (test)
CLIP Score (Input)0.867
28
Image EditingGEdit-Bench-EN v1.0 (Full set)
G Score (SC)3.053
22
Instructive image editingMagicBrush (test)
CLIP Image0.898
20
Instruction-guided image editingGEdit-Bench EN Full set
G_SC3.18
20
Showing 10 of 59 rows

Other info

Code

Follow for update