Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Reasoning on MMMU-Pro (Pass@1)

70.64Pass@1

GPT-5-Nano-High

49.278454.824260.3765.9158May 11, 2025Jun 27, 2025Aug 13, 2025Sep 29, 2025Nov 15, 2025Jan 1, 2026Feb 18, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
70.64
2026.02
70.29
2026.02
70.29
2026.02
69.19
2025.05
68.8
2026.02
67.69
2025.05
67.6
2026.02
66.82
2025.05
66.4
2026.02
65.78
2026.02
65.08
2026.02
63.87
2026.02
63.47
2026.02
60.69
2025.05
59.9
2025.05
54.5
2025.05
51.1
2025.05
50.1