Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Image Editing on Image Editing Dataset (Action)
Loading...
5.826
Unchanged Regions
MLLM-as-a-Judge
4.29304
4.69102
5.089
5.48698
Feb 13, 2026
Unchanged Regions
Global Consistency
Identity Preservation
Scale Realism
Spatial Relationship
Texture and Detail
Image Quality
Color and Lighting
Seamlessness
Alignment
Completeness
Plausibility
Overall Average
Updated 4d ago
Evaluation Results
Method
Method
Links
Unchanged Regions
Global Consistency
Identity Preservation
Scale Realism
Spatial Relationship
Texture and Detail
Image Quality
Color and Lighting
Seamlessness
Alignment
Completeness
Plausibility
Overall Average
MLLM-as-a-Judge
Judge=Our Judge
2026.02
5.826
6.087
6.696
6.565
6.043
6
6.174
5.87
5.913
6.957
6.87
6.826
6.319
Human
Judge=Human
2026.02
4.352
4.769
4.871
5.984
5.948
5.643
5.947
5.549
5.598
5.666
5.719
5.692
5.478
Feedback
Search any
task
Search any
task