Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Reasoning on MMBench CN

82Accuracy

Instruct

49.44857.89966.3574.801Aug 8, 2024Nov 22, 2024Mar 9, 2025Jun 23, 2025Oct 8, 2025Jan 22, 2026May 9, 2026
Updated 15d ago

Evaluation Results

MethodLinks
2026.03
82
2026.03
82
2026.03
82
2026.03
82
2026.02
81.8
2026.03
81.6
2024.08
81.5
2026.03
81.5
2024.08
81.4
2026.03
81.2
2026.03
81
2026.03
81
2026.02
80.5
2026.02
79
2026.03
79
2026.02
78.1
2026.03
78
2026.03
78
2026.02
76.5
2026.05
75.93
2026.05
75.58
2026.05
75.51
2026.02
75.5
2026.05
75.49
2026.05
75.44
2026.05
75.33
2026.03
75
2026.05
74.79
2026.03
74.5
2026.05
74.33
2026.05
74.19
2026.05
73.94
2026.05
73.87
2026.05
73.2
2026.05
72.41
2026.02
71.5
2026.05
71.19
2026.05
70.87
2026.05
70.71
2024.10
63.8
2024.10
63.6
2025.08
60.6
2024.10
60.1
2024.08
59.8
2024.08
59.6
2025.08
59.3
2025.08
59.1
2024.08
58.9
2025.08
58.6
2024.10
58.3
2024.08
58.3
2026.04
58.1
2025.08
58.1
2026.04
57.5
2026.04
57.5
2026.04
57.4
2025.08
57.4
2026.04
57.2
2026.04
57.1
2026.04
57
2025.08
57
2025.08
57
2026.04
56.9
2026.04
56.8
2024.10
56.7
2025.08
56.7
2026.04
56.6
2026.04
56.6
2025.08
56.4
2025.08
56.4
2026.04
56.1
2026.04
56.1
2025.08
56
2026.04
55.8
2025.08
55.8
2025.08
55.6
2025.08
55.4
2026.04
55.3
2026.04
55.2
2025.08
55.2
2025.08
55.2
2026.04
55.1
2026.04
54.7
2026.04
53.7
2026.04
53.7
2026.04
53.6
2026.04
53.6
2025.08
53.5
2026.04
53.2
2026.04
53.1
2026.04
52.9
2026.04
52.8
2025.08
52.3
2026.04
51.9
2026.04
51.7
2026.04
51.4
2025.08
51.3
2026.04
51.1
2025.08
51
2025.08
50.7
Showing 100 of 113 rows