Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Reasoning on MMMU Pro (Accuracy)

85.6Accuracy

CoT2-Meta

31.83245.79159.7573.709Jun 8, 2025Aug 3, 2025Sep 28, 2025Nov 24, 2025Jan 19, 2026Mar 16, 2026May 12, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.03
85.6
2026.03
81.3
2026.03
77.8
2026.01
76.96
2026.05
75.1
2026.05
73.8
2026.03
73.1
2026.05
72.83
2026.01
72.37
2026.01
70.6
2026.05
70.1
2025.11
69.1
2026.03
68.4
2025.11
68.3
2026.05
67.69
2026.01
67.18
2026.01
65.84
2026.01
64.08
2026.05
63
2025.06
62.4
2026.05
60.4
2026.05
60.3
2026.01
56.13
2026.04
56.1
2026.04
55.3
2026.05
53.2
2026.04
53.1
2026.01
51.9
2025.06
51.9
2026.04
51.9
2025.11
51.7
2025.06
51.5
2026.05
51.5
2026.01
51.47
2026.05
51.3
2025.11
51
2026.04
50.2
2026.04
49.6
2025.11
49.6
2025.06
49.5
2025.12
46.7
2025.11
46.3
2025.12
46.2
2025.12
45.2
2025.11
42.7
2025.06
42.4
2026.03
42.08
2025.12
41.7
2025.11
41.7
2026.03
41.5
2026.03
41.38
2026.03
41.38
2025.11
41.3
2025.11
41.3
2026.01
41
2026.04
41
2026.01
40.7
2026.01
40.6
2025.12
40.3
2026.03
40.11
2025.11
40
2025.12
39.7
2026.03
39.65
2026.01
39.5
2025.11
39.5
2025.12
39.4
2026.03
39.01
2025.12
38.9
2025.06
38.8
2026.03
38.32
2025.06
38.3
2025.11
38.3
2026.03
38.2
2026.04
38.2
2026.01
38
2026.01
38
2025.06
37.8
2025.06
37.6
2026.04
37.6
2025.12
37.4
2025.12
37.3
2025.06
37.2
2025.12
37.1
2026.03
37.1
2025.06
37
2026.03
36.5
2025.12
36.4
2025.12
36.4
2025.12
36.3
2026.03
36.3
2026.04
36
2026.03
35.49
2026.04
35.4
2026.03
35.08
2026.04
35
2025.12
34.7
2026.01
34.3
2025.06
34.3
2026.04
34.2
2026.01
33.9
Showing 100 of 146 rows