Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Multimodal Understanding on MMMU Pro
Loading...
56.9
Accuracy
Qwen3-VL-32B-Instruct
14.572
25.561
36.55
47.539
May 18, 2026
Accuracy
Updated 8d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3-VL-32B-Instruct
Backbone model=Qwen3-V...
2026.05
56.9
IVR-R1
Backbone model=Qwen3-V...
2026.05
53.3
Vision-R1
Backbone model=Qwen3-V...
2026.05
52.5
Vision-SR1
Backbone model=Qwen3-V...
2026.05
49.3
Supervised Fine-tuning (before RL)
Backbone model=Qwen3-V...
2026.05
48.6
IVR-R1
Backbone model=Qwen2.5...
2026.05
47.8
Vision-R1
Backbone model=Qwen2.5...
2026.05
47.2
Vision-SR1
Backbone model=Qwen2.5...
2026.05
45.5
Supervised Fine-tuning (before RL)
Backbone model=Qwen2.5...
2026.05
43.6
Zero-shot Inference (before RL)
Backbone model=Qwen3-V...
2026.05
42.7
Qwen2.5-VL-72B-Instruct
Backbone model=Qwen2.5...
2026.05
36.6
Qwen2.5-VL-32B-Instruct
Backbone model=Qwen2.5...
2026.05
21.7
Zero-shot Inference (before RL)
Backbone model=Qwen2.5...
2026.05
16.2
Feedback
Search any
task
Search any
task