Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-discipline Reasoning on MMMU-Pro

52.2Accuracy

Llama 4 Scout

48.35249.35150.3551.349Feb 12, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.02
52.2
2026.02
51.9
2026.02
51.9
2026.02
51.7
2026.02
51.6
2026.02
51.5
2026.02
49.2
2026.02
48.5