Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on MathVista (Accuracy)

89.2Accuracy

Gemini 3-Pro

69.887274.901179.91584.9289Jun 18, 2025Aug 13, 2025Oct 8, 2025Dec 3, 2025Jan 28, 2026Mar 25, 2026May 21, 2026
Updated 12d ago

Evaluation Results

MethodLinks
89.2--
2026.05
88.5--
2026.02
86.8--
2026.05
85.9--
2026.05
85.9--
2026.02
85.8--
2026.02
84.8--
2026.05
84.6--
2026.02
84.4--
83.8--
82.7--
2026.02
82.1--
81.9--
2025.09
81.9--
2026.05
81.9--
2025.06
81.5--
2025.06
81.3--
2025.09
80.9--
2025.06
80.3--
2025.06
80.1--
2026.05
80.1--
78.4--
2026.05
77.6--
2026.05
77.5--
2026.03
77.4--
2026.05
77.4--
77.2--
2026.03
77.2--
2026.05
77.2--
2026.05
77.2--
2026.03
76.3--
2026.05
76.1--
2026.05
76--
2026.05
75.8--
2026.03
75.6--
2025.09
75.6--
2026.05
75.4--
2026.05
75.3--
2026.04
75.1--
2026.05
75.1--
2026.02
75--
2025.06
74.9--
2026.03
74.9--
2026.03
74.9--
2026.04
74.9--
2026.04
74.9--
2026.05
74.9--
2026.03
74.8--
2026.04
74.8--
2026.02
74.7--
2026.02
74.3--
2026.03
74.2--
2026.05
74.2--
2026.02
74.1--
2026.05
74.1--
2025.11
74--
2026.05
73.9--
2026.01
73.5--
2026.03
73.5--
2025.09
73.5--
2026.04
73.5--
2025.11
73.5--
2026.05
73.5--
2026.05
73.5--
2026.05
73.4--
2026.02
73.3--
2026.05
73.3--
2026.02
73.27--
2026.05
73.2--
2026.05
73.2--
2026.02
73--
2025.11
73--
2026.02
72.8--
2026.04
72.8--
2026.05
72.8--
2026.02
72.5--
2025.11
72.4--
72.3--
2026.04
72.3--
2026.02
72.1--
2025.11
72.1--
2025.11
72--
2025.06
71.9--
2026.03
71.9--
2025.11
71.9--
2026.02
71.84--
2025.12
71.8--
2026.05
71.7--
2026.03
71.6--
2026.03
71.6--
2026.04
71.6--
2026.05
71.6--
2026.02
71.5--
2025.06
71.4--
2026.02
71.38--
2026.03
71.2--
2026.01
71--
2025.11
71--
2026.05
70.9--
2026.02
70.63--
Showing 100 of 382 rows