Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on MathVista (Accuracy)

89.2Accuracy

Gemini 3-Pro

65.69671.79877.984.002Sep 26, 2024Dec 28, 2024Apr 1, 2025Jul 4, 2025Oct 6, 2025Jan 8, 2026Apr 12, 2026
Updated 4d ago

Evaluation Results

MethodLinks
89.2-
2026.02
86.8-
2026.02
85.8-
2026.02
84.8-
2026.02
84.4-
83.8-
82.7-
2026.02
82.1-
81.9-
2025.09
81.9-
2025.06
81.5-
2025.06
81.3-
2025.09
80.9-
2025.06
80.3-
2025.06
80.1-
78.4-
2026.03
77.4-
77.2-
2026.03
77.2-
2026.03
76.3-
2026.03
75.6-
2025.09
75.6-
2026.04
75.1-
2026.02
75-
2025.06
74.9-
2026.03
74.9-
2026.03
74.9-
2026.04
74.9-
2026.04
74.9-
2026.03
74.8-
2026.04
74.8-
2026.02
74.7-
2026.02
74.3-
2026.03
74.2-
2026.02
74.1-
2026.01
73.5-
2026.03
73.5-
2025.09
73.5-
2026.04
73.5-
2026.02
73.3-
2026.02
73.27-
2026.02
73-
2026.02
72.8-
2026.04
72.8-
2026.02
72.5-
72.3-
2026.04
72.3-
2026.02
72.1-
2025.06
71.9-
2026.03
71.9-
2026.02
71.84-
2025.12
71.8-
2026.03
71.6-
2026.03
71.6-
2026.04
71.6-
2026.02
71.5-
2025.06
71.4-
2026.02
71.38-
2026.03
71.2-
2026.01
71-
2026.02
70.63-
70.5-
2025.06
70.5-
2026.02
70.2-
2025.06
70.2-
2025.05
70.1-
2026.04
70.1-
2026.03
70-
2024.09
69.9-
2026.04
69.8-
2026.01
69.4-
2026.01
69.3-
2025.06
69.1-
2025.05
68.3-
2026.02
68.2-
2025.05
68.2-
2025.06
68.2-
2025.11
68.2-
2026.03
68.2-
2025.09
68.2-
2026.04
68.2-
2025.06
68-
2025.09
68-
2026.02
67.8-
2026.04
67.8-
67.7-
67.7-
2026.02
67.6-
2025.06
67.6-
2026.04
67.53-
2026.02
67.5-
67.5-
2025.06
67.5-
2026.03
67.3-
2026.02
67.2-
2026.03
67.1-
2025.09
67.1-
2025.06
66.8-
2026.03
66.8-
2025.06
66.6-
Showing 100 of 257 rows