Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Multimodal Reasoning on MathVista (Accuracy)

85.6Accuracy

Seed-1.5-thinking

71.35275.05178.7582.449Jun 5, 2025Aug 3, 2025Oct 2, 2025Dec 1, 2025Jan 29, 2026Mar 30, 2026May 29, 2026
Updated 2d ago

Evaluation Results

MethodLinks
2025.12
85.6
2025.12
83.8
2025.11
83
2026.03
80.2
2026.03
79.9
2026.03
79.4
2026.03
79.3
2025.06
79
2026.03
78.8
2026.03
78.5
2026.03
78.4
2025.06
78.2
77.6
2026.02
77.1
2025.06
76.8
2025.06
76.8
2025.06
76.8
2025.06
76.6
2026.02
76.5
2025.12
76.1
2025.06
76.1
2025.06
76.1
2026.03
76.1
2026.03
76.1
2025.12
76
2026.05
75.93
2025.06
75.9
2026.03
75.9
2025.06
75.8
2025.11
75.7
2025.12
75.6
2026.03
75.5
2026.03
75.4
75.4
2025.12
75.1
2025.06
75.1
2025.06
75.1
2025.06
75.1
2025.12
74.9
2025.06
74.9
2025.12
74.8
74.8
2026.03
74.8
2026.03
74.8
2026.03
74.7
2026.03
74.7
2025.06
74.7
2026.03
74.7
74.3
74.2
2025.06
74.2
2025.06
74.2
2026.03
74.2
2026.04
74.2
2025.06
74.2
2026.03
73.9
2026.03
73.9
73.8
2026.03
73.8
2026.02
73.7
2025.11
73.7
2025.06
73.7
2026.04
73.7
2025.06
73.6
2026.03
73.6
2026.01
73.5
2026.03
73.5
2026.03
73.5
2026.03
73.5
2026.03
73.5
2026.03
73.5
2026.03
73.5
2025.12
73.4
2025.06
73.3
2025.12
73.1
2025.12
73
2025.12
73
2026.03
73
2026.03
73
2025.06
73
72.9
2026.03
72.7
2026.03
72.7
2025.08
72.7
2025.11
72.6
2026.03
72.6
2026.03
72.6
2025.11
72.5
2026.04
72.4
2025.12
72.3
2025.12
72.3
72.3
2026.03
72.3
2025.08
72.2
2026.05
72.08
2025.12
72
71.9
2026.03
71.9
2025.11
71.9
2026.03
71.9
Showing 100 of 258 rows