Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Vision-Language Understanding on MMBench (test)
Loading...
78.8
Overall Accuracy
Baseline
15.88
32.215
48.55
64.885
Oct 13, 2025
Overall Accuracy
AR Score
CP Score
FP-C Score
FP-S Score
LR Score
RR Score
Updated 1d ago
Evaluation Results
Method
Method
Links
Overall Accuracy
AR Score
CP Score
FP-C Score
FP-S Score
LR Score
RR Score
Baseline
Model=Qwen2.5-VL-3B, A...
2025.10
78.8
81.3
83.1
69.2
84.9
66.5
75.4
Baseline
Model=InternVL3-1B, Au...
2025.10
72.7
77.4
82.4
58.7
79.4
50.3
66.8
NTE
Model=InternVL3-1B, Au...
2025.10
72.6
77.8
82
58.3
78.9
50.9
67.3
NTE
Model=Qwen2.5-VL-3B, A...
2025.10
18.3
29.2
15.6
17
12.8
17.3
22.3
Feedback
Search any
task
Search any
task