Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-modal Reasoning on MMMU-Pro

85.6Accuracy

CoT2-Meta

18.36435.819553.27570.7305Mar 30, 2026Mar 31, 2026Apr 1, 2026Apr 2, 2026
Updated 15d ago

Evaluation Results

MethodLinks
2026.03
85.614.5
2026.03
78.410.5
2026.03
77.88.3
2026.03
73.14.8
2026.03
71.55.7
2026.03
68.4-
2026.03
68.23
2026.03
64.6-
2026.03
55.212.2
2026.03
48.66.4
2026.03
44.53.4
2026.03
41-
2026.04
39.01-
2026.04
38.75-
2026.04
38.38-
2026.04
37.13-
2026.04
36.99-
2026.04
36.84-
2026.04
35.1-
2026.04
34.5-
2026.04
30.3-
2026.04
29.33-
2026.04
28.83-
2026.04
28.76-
2026.04
28.02-
2026.04
27.2-
2026.04
26.67-
2026.04
20.95-