Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Knowledge and Math on MMMU (val)

69.1Accuracy

GPT-4o

25.10836.52947.9559.371Oct 8, 2024Jan 13, 2025Apr 20, 2025Jul 26, 2025Oct 31, 2025Feb 5, 2026May 13, 2026
Updated 20d ago

Evaluation Results

MethodLinks
2024.10
69.1
62.2
2026.01
60.6
59.4
2026.01
57.1
2024.10
56.4
56.1
2024.10
54.9
2026.01
54.8
2026.01
54.4
54.2
52.5
2026.01
51.5
50.7
2026.05
48
2026.05
47.4
2026.05
46.6
2026.05
46.4
2026.05
46.4
2026.05
46.3
2026.05
46
2026.05
45.9
2026.05
45.8
45.7
45.3
2026.05
45.1
44.2
2026.05
39.6
2026.05
36.3
2026.05
36.1
2026.05
36.1
2026.05
31.6
2026.05
26.8