Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-discipline Reasoning on MMMU-Pro

52.2Accuracy

Llama 4 Scout

14.34424.1723443.828Oct 14, 2025Nov 3, 2025Nov 23, 2025Dec 13, 2025Jan 2, 2026Jan 22, 2026Feb 12, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
52.2
2026.02
51.9
2026.02
51.9
2026.02
51.7
2026.02
51.6
2026.02
51.5
2026.02
49.2
2026.02
48.5
2025.10
37.1
2025.10
35.8
2025.10
33.8
2025.10
32
2025.10
31.1
2025.10
29.8
2025.10
28.4
2025.10
20.5
2025.10
19.5
2025.10
18.8
2025.10
16.2
2025.10
16.2
2025.10
15.8