Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-modal Understanding on MMMU (test)
Loading...
62.9
Final Performance
Full RT
43.452
48.501
53.55
58.599
Apr 6, 2026
Final Performance
MV 2B(S)/8B(S) Score
MV 2B(R)/8B(S) Score
Runtime (RT)
Updated 11d ago
Evaluation Results
Method
Method
Links
Final Performance
MV 2B(S)/8B(S) Score
MV 2B(R)/8B(S) Score
Runtime (RT)
Full RT
Backbone=Qwen3-VL-8B+2B-R
2026.04
62.9
-
-
-
MAI
Backbone=Qwen3-VL-2B/8B
2026.04
62
498
142
260
8B(S) Baseline
Backbone=Qwen3-VL-8B,...
2026.04
55.6
-
-
-
2B(R) Baseline
Backbone=Qwen3-VL-2B,...
2026.04
47.9
-
-
-
2B(S) Baseline
Backbone=Qwen3-VL-2B,...
2026.04
44.2
-
-
-
Feedback
Search any
task
Search any
task