Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-discipline Multimodal Understanding on MMMU

84.2Accuracy

GPT-5

50.60859.32968.0576.771Jan 29, 2024Jun 18, 2024Nov 7, 2024Mar 29, 2025Aug 18, 2025Jan 7, 2026May 29, 2026
Updated 2d ago

Evaluation Results

MethodLinks
2025.12
84.2-----
2025.12
79-----
76.9-----
76.982.8----
2025.06
76.583.4----
2025.06
76.383.4----
76-----
7682.5----
2025.12
75.4-----
74.7-----
74.4-----
2025.12
74-----
2025.12
74-----
2025.12
74-----
2025.12
73.4-----
73.480.4----
2025.12
72.9-----
2025.12
72.9-----
72.7-----
2025.12
72.7-----
2025.06
72.780.1----
2025.06
72.580----
72.2-----
2025.12
71.4-----
2025.12
71.4-----
2025.12
71.3-----
2025.12
71.3-----
71-----
2025.12
70.3-----
69.9-----
2025.12
69.6-----
69.677----
2024.12
69.3-----
2024.09
69.2-----
2025.09
69.2-----
2025.12
69.1-----
68.3-----
68.2-----
2025.12
68-----
2025.12
67.4-----
2025.12
66.7-----
2025.12
66.6-----
64.6-----
63.2-----
2024.06
62.8-----
2026.01
62.8--64.397.7-
2025.12
62.7-----
62.2-----
62.2-----
2025.12
61.2-----
2026.01
61--182.233.5-
2026.01
60.1--105.856.8-
2024.09
59.7-----
2025.12
59.7-----
2025.12
59.6-----
2026.01
59.5--136.843.5-
2026.01
59.3--127.146.7-
2026.01
59.2--104.956.4-
2025.12
59-----
2026.01
58.9--86.368.3-
2026.05
58.63-----
2026.01
58.6--78.274.9-
58.5-----
2025.12
57.4-----
2025.09
57.2-----
2026.02
56.98-----
2024.01
56.8-----
2024.03
56.8-----
2024.07
56.8-----
2024.09
56.8-----
2024.09
56.8-----
56.8-----
2025.09
56.6-----
2026.02
56.42-----
2024.09
56.1-----
2025.12
56-----
2025.12
55.8-----
55.4-----
2026.05
55.4-----
2026.02
55.31-----
2025.12
55-----
54.2-----
2024.12
54.1-----
2025.12
54.1-----
2026.05
54.1-----
2024.06
53.8-----
2026.05
53.66-----
2026.02
53.52-----
2024.07
53.4-----
2025.12
53.4-----
2024.06
53.3-----
2026.02
53.3-----
2026.02
53.07-----
52.7-----
2026.02
52.51-----
52.5-----
2026.05
52.49-----
2026.04
52.3-----
2024.09
52.1-----
2024.09
51.9-----
Showing 100 of 590 rows