Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Edit Inspectors Questions on UltraEdit
Loading...
63
Accuracy
GPT-4
51.352
54.376
57.4
60.424
Jun 11, 2025
Accuracy
Contextual Consistency
Technical Precision
Artifacts Score
Caption Acc (Diff)
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Contextual Consistency
Technical Precision
Artifacts Score
Caption Acc (Diff)
GPT-4
2025.06
63
48.5
53.4
42.9
55.6
Qwen2.5 VL
2025.06
62.1
58.2
43.8
50
54.4
GPT-4 Turbo
2025.06
57.5
37.1
48.5
51
57.5
InternVL3
2025.06
54.4
61.9
46.2
41
44.4
LLaVA
Training Protocol=Supe...
2025.06
54.3
-
-
55.6
48.1
LLaVA
2025.06
52.6
-
-
52.7
50
GPT-4o
2025.06
51.8
41.6
46.4
56.3
55.6
Feedback
Search any
task
Search any
task