Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Edit Inspectors Question Answering on MagicBrush (test)
Loading...
73.7
Accuracy
GPT-4
57.892
61.996
66.1
70.204
Jun 11, 2025
Accuracy
Contextual Consistency
Technical Precision
Artifacts Score
Diff Caption Acc
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Contextual Consistency
Technical Precision
Artifacts Score
Diff Caption Acc
GPT-4
2025.06
73.7
48.2
49.2
51.6
63.2
GPT-4o
2025.06
71
56.7
47.6
60.6
60.7
Qwen2.5 VL
2025.06
64
38.4
47.5
50
59.8
InternVL3
2025.06
63.6
45.6
42.6
52.5
58.4
LLaVA
training=Supervised
2025.06
62.3
-
-
56.2
46.5
GPT-4 Turbo
2025.06
60
59.7
49.8
54.1
61.1
LLaVA
2025.06
58.5
-
-
44.8
50
Feedback
Search any
task
Search any
task