Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Difference Caption Generation on Imagen edits 3
Loading...
44
Main Difference Score
GPT-4o
2.4
13.2
24
34.8
Jun 11, 2025
Main Difference Score
MP Score
MP Soft Score
HR Score
HR Soft Score
Average Difference
No Diffs Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Main Difference Score
MP Score
MP Soft Score
HR Score
HR Soft Score
Average Difference
No Diffs Rate
GPT-4o
2025.06
44
11
12
73
70
1.9
0
GPT-4
2025.06
29
9
11
78
73
2.5
80
Qwen2.5 VL
2025.06
29
10
12
73
67
1.5
120
GPT-4 Turbo
2025.06
27
8
9
82
80
1.5
450
InternVL3
2025.06
13
9
12
92
88
3.2
400
LLaVA
training=supervised
2025.06
11
-
-
-
-
-
-
LLaVA
2025.06
4
-
-
-
-
-
-
Feedback
Search any
task
Search any
task