Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-modal Reasoning on MMMU-Pro

85.6Accuracy

CoT2-Meta

18.36435.819553.27570.7305Mar 30, 2026Apr 2, 2026Apr 6, 2026Apr 10, 2026Apr 14, 2026Apr 18, 2026Apr 22, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
85.614.5
2026.03
78.410.5
2026.03
77.88.3
2026.03
73.14.8
2026.03
71.55.7
2026.03
68.4-
2026.03
68.23
2026.03
64.6-
2026.04
57.24-
2026.04
57.07-
2026.04
56.85-
2026.04
55.78-
2026.04
55.49-
2026.04
55.38-
2026.03
55.212.2
2026.04
54.23-
51.75-
2026.03
48.66.4
2026.03
44.53.4
2026.03
41-
2026.04
39.01-
2026.04
38.75-
2026.04
38.38-
2026.04
37.13-
2026.04
36.99-
2026.04
36.84-
2026.04
35.1-
2026.04
34.5-
2026.04
30.3-
2026.04
29.33-
2026.04
28.83-
2026.04
28.76-
2026.04
28.02-
2026.04
27.2-
2026.04
26.67-
2026.04
20.95-