Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-discipline Multimodal Understanding on MMMU

84.2Accuracy

GPT-5

47.38456.94266.576.058Jan 29, 2024Jun 1, 2024Oct 3, 2024Feb 4, 2025Jun 8, 2025Oct 10, 2025Feb 11, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2025.12
84.2----
2025.12
79----
76.9----
76----
2025.12
75.4----
74.7----
74.4----
2025.12
74----
2025.12
74----
2025.12
74----
2025.12
73.4----
2025.12
72.9----
2025.12
72.9----
72.7----
2025.12
72.7----
72.2----
2025.12
71.4----
2025.12
71.4----
2025.12
71.3----
2025.12
71.3----
71----
2025.12
70.3----
69.9----
2025.12
69.6----
2024.12
69.3----
2024.09
69.2----
2025.12
69.1----
68.3----
68.2----
2025.12
68----
2025.12
67.4----
2025.12
66.7----
2025.12
66.6----
64.6----
63.2----
2024.06
62.8----
2026.01
62.8--64.397.7
2025.12
62.7----
62.2----
2025.12
61.2----
2026.01
61--182.233.5
2026.01
60.1--105.856.8
2024.09
59.7----
2025.12
59.7----
2025.12
59.6----
2026.01
59.5--136.843.5
2026.01
59.3--127.146.7
2026.01
59.2--104.956.4
2025.12
59----
2026.01
58.9--86.368.3
2026.01
58.6--78.274.9
58.5----
2025.12
57.4----
2026.02
56.98----
2024.01
56.8----
2024.03
56.8----
2024.07
56.8----
2024.09
56.8----
2024.09
56.8----
56.8----
2026.02
56.42----
2024.09
56.1----
2025.12
56----
2025.12
55.8----
55.4----
2026.02
55.31----
2025.12
55----
2024.12
54.1----
2025.12
54.1----
2024.06
53.8----
2026.02
53.52----
2024.07
53.4----
2025.12
53.4----
2024.06
53.3----
2026.02
53.3----
2026.02
53.07----
52.7----
2026.02
52.51----
2024.09
52.1----
2024.09
51.9----
2024.03
51.4----
2024.12
51.2----
2025.12
51.2----
2025.04
51----
2025.04
51----
2025.12
50.9----
2024.09
50.3----
2024.12
49.9----
2024.12
49.8----
2024.09
49.8----
2025.12
49.8----
2024.09
49.7----
2024.09
49.7----
2024.12
49.3----
2026.02
49.16----
2025.12
49----
2024.03
48.9----
2024.12
48.8----
2024.06
48.8----
2025.12
48.8----
Showing 100 of 401 rows