Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multimodal Reasoning on MMMU

83.89Accuracy

Gemini-2.5 (Pro)

22.020438.082754.14570.2073Feb 2, 2023Aug 6, 2023Feb 7, 2024Aug 10, 2024Feb 11, 2025Aug 15, 2025Feb 17, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.01
83.89
2026.01
80.11
2026.01
79.11
2026.01
78.7
2026.01
78.11
2026.01
78.11
2026.01
75.2
2026.01
73.53
2026.01
71.69
2026.01
71.17
2026.01
71.14
2025.12
64.9
2025.12
62.7
59.4
59
2025.12
59
2025.12
57.2
56.8
2025.12
55
53.4
2025.12
53.4
2025.11
51.78
2025.11
51.67
48.6
2025.12
48.6
2026.02
48.3
2025.12
47.7
2025.12
46.1
2025.11
45.78
2025.12
45.6
2025.11
45.33
2026.02
44.7
2025.12
44.6
2025.12
43.4
2025.12
42.9
2025.11
41.6
2025.12
41.4
41.1
2025.12
41.1
2023.02
28.7
2023.02
28.7
2023.02
27.9
2023.02
26.8
2023.02
24.4