Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-discipline Multimodal Understanding on MMMU

84.2Accuracy

GPT-5

48.517657.781367.04576.3087Jan 29, 2024Jun 4, 2024Oct 9, 2024Feb 14, 2025Jun 21, 2025Oct 26, 2025Mar 3, 2026
Updated 6d ago

Evaluation Results

MethodLinks
2025.12
84.2-----
2025.12
79-----
76.9-----
76.982.8----
2025.06
76.583.4----
2025.06
76.383.4----
76-----
7682.5----
2025.12
75.4-----
74.7-----
74.4-----
2025.12
74-----
2025.12
74-----
2025.12
74-----
2025.12
73.4-----
73.480.4----
2025.12
72.9-----
2025.12
72.9-----
72.7-----
2025.12
72.7-----
2025.06
72.780.1----
2025.06
72.580----
72.2-----
2025.12
71.4-----
2025.12
71.4-----
2025.12
71.3-----
2025.12
71.3-----
71-----
2025.12
70.3-----
69.9-----
2025.12
69.6-----
69.677----
2024.12
69.3-----
2024.09
69.2-----
2025.12
69.1-----
68.3-----
68.2-----
2025.12
68-----
2025.12
67.4-----
2025.12
66.7-----
2025.12
66.6-----
64.6-----
63.2-----
2024.06
62.8-----
2026.01
62.8--64.397.7-
2025.12
62.7-----
62.2-----
2025.12
61.2-----
2026.01
61--182.233.5-
2026.01
60.1--105.856.8-
2024.09
59.7-----
2025.12
59.7-----
2025.12
59.6-----
2026.01
59.5--136.843.5-
2026.01
59.3--127.146.7-
2026.01
59.2--104.956.4-
2025.12
59-----
2026.01
58.9--86.368.3-
2026.01
58.6--78.274.9-
58.5-----
2025.12
57.4-----
2026.02
56.98-----
2024.01
56.8-----
2024.03
56.8-----
2024.07
56.8-----
2024.09
56.8-----
2024.09
56.8-----
56.8-----
2026.02
56.42-----
2024.09
56.1-----
2025.12
56-----
2025.12
55.8-----
55.4-----
2026.02
55.31-----
2025.12
55-----
2024.12
54.1-----
2025.12
54.1-----
2024.06
53.8-----
2026.02
53.52-----
2024.07
53.4-----
2025.12
53.4-----
2024.06
53.3-----
2026.02
53.3-----
2026.02
53.07-----
52.7-----
2026.02
52.51-----
2024.09
52.1-----
2024.09
51.9-----
2024.03
51.4-----
2026.03
51.33-----
2024.12
51.2-----
2025.12
51.2-----
2026.03
51.11-----
2025.04
51-----
2025.04
51-----
2025.12
50.9-----
2026.03
50.33-----
2024.09
50.3-----
2024.12
49.9-----
2026.03
49.89-----
Showing 100 of 499 rows