Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-discipline Reasoning on MMMU-Pro
Loading...
52.2
Accuracy
Llama 4 Scout
48.352
49.351
50.35
51.349
Feb 12, 2026
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
Llama 4 Scout
Zero-shot=true
2026.02
52.2
Qwen2.5-VL-32B + AT-RL (Ours)
Zero-shot=true
2026.02
51.9
OpenAI GPT-4o
Zero-shot=true
2026.02
51.9
Gemini 2.0 Flash
Zero-shot=true
2026.02
51.7
Qwen2.5-VL-72B Instruct
Zero-shot=true
2026.02
51.6
Claude 3.5 Sonnet
Zero-shot=true
2026.02
51.5
Qwen2.5-VL-32B + VPPO
Zero-shot=true
2026.02
49.2
Qwen2.5-VL-32B Instruct
Zero-shot=true
2026.02
48.5
Feedback
Search any
task
Search any
task