Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Reasoning on MMMU Pro (Accuracy)

85.6Accuracy

CoT2-Meta

19.66436.78253.971.018Jun 8, 2025Jul 28, 2025Sep 17, 2025Nov 7, 2025Dec 27, 2025Feb 16, 2026Apr 8, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2026.03
85.6
2026.03
81.3
2026.03
77.8
2026.01
76.96
2026.03
73.1
2026.01
72.37
2026.01
70.6
2026.03
68.4
2026.01
67.18
2026.01
65.84
2026.01
64.08
2025.06
62.4
2026.01
56.13
2026.04
56.1
2026.04
55.3
2026.04
53.1
2026.01
51.9
2025.06
51.9
2026.04
51.9
2025.06
51.5
2026.01
51.47
2026.04
50.2
2026.04
49.6
2025.06
49.5
2025.12
46.7
2025.12
46.2
2025.12
45.2
2025.06
42.4
2026.03
42.08
2025.12
41.7
2026.03
41.5
2026.03
41.38
2026.03
41.38
2026.01
41
2026.01
40.7
2026.01
40.6
2025.12
40.3
2026.03
40.11
2025.12
39.7
2026.03
39.65
2026.01
39.5
2025.12
39.4
2026.03
39.01
2025.12
38.9
2025.06
38.8
2026.03
38.32
2025.06
38.3
2026.03
38.2
2026.01
38
2026.01
38
2025.06
37.8
2025.06
37.6
2025.12
37.4
2025.12
37.3
2025.06
37.2
2025.12
37.1
2026.03
37.1
2025.06
37
2026.03
36.5
2025.12
36.4
2025.12
36.4
2025.12
36.3
2026.03
36.3
2026.03
35.49
2026.03
35.08
2025.12
34.7
2026.01
34.3
2025.06
34.3
2026.01
33.9
2025.06
33.8
2026.03
33.12
2025.12
33.1
2025.12
32.8
2025.12
32.8
2026.03
31.85
2026.03
31.73
2026.03
31.6
2026.03
31.4
2026.03
31.27
2025.12
31.1
2026.01
30.5
2026.03
30.34
2026.01
30.3
2026.03
29.88
2025.06
29.6
2025.06
29.4
2026.03
29.01
2026.01
29
2025.12
28.1
2026.03
28
2026.01
27.8
2026.01
27.3
2026.03
26.76
2025.12
25.2
2026.01
24.1
2025.12
24
2025.12
23.9
2025.12
23.8
2025.12
23.6
2025.12
22.2
Showing 100 of 107 rows