Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Reasoning on MathVista (Accuracy)

89.2Accuracy

Gemini 3-Pro

2.131224.735647.3469.9444May 30, 2024Sep 10, 2024Dec 23, 2024Apr 6, 2025Jul 19, 2025Oct 31, 2025Feb 12, 2026
Updated 2d ago

Evaluation Results

MethodLinks
89.2-
2026.02
86.8-
2026.02
85.8-
2026.02
84.8-
2026.02
84.4-
82.7-
2026.02
82.1-
2026.02
75-
2026.02
74.3-
2026.01
73.5-
2026.02
72.5-
2026.02
72.1-
2025.12
71.8-
2026.02
71.5-
2026.01
71-
2024.09
69.9-
2026.01
69.4-
2026.01
69.3-
2026.02
68.2-
2026.02
67.6-
2026.02
67.5-
2026.02
66.5-
2024.09
66.2-
2024.09
65.5-
2026.02
65.1-
2024.12
64.8-
2024.12
64.6-
2026.01
64.4-
2026.02
64.1-
2026.01
64.1-
2026.02
63.9-
2026.01
63.9-
2024.09
63.8-
2026.01
63.7-
2026.02
63.6-
2026.02
63.51-
2026.01
63.5-
2026.01
63.5-
2025.11
63.5-
2026.02
63.46-
2026.02
62.75-
2024.09
62.6-
2026.01
62.6-
2025.11
62.5-
2025.11
62.2-
2026.02
62.1-
2025.11
61.6-
2025.12
61.5-
2026.01
61.4-
61.3-
2026.01
61.3-
2026.02
61.1-
2026.02
60.9-
60.86-
2026.01
60.6-
2026.02
60.2-
2025.12
59.5-
2026.02
58.3-
2026.02
58.15-
2024.12
57.6-
2026.02
57.15-
57-
2026.02
56.4-
2026.02
55.6-
2026.02
55.2-
2026.01
55.1-
2026.01
54.7-
2026.01
54.6-
2026.02
54.41-
2026.01
53.8-
2026.01
53.8-
2026.02
52.98-
2026.02
52.8-
2026.01
52.3-
52.1-
2024.09
51.9-
2026.02
51.5-
2026.01
51.5-
2025.11
51.5-
2024.09
49.9-
2026.01
49.5-
2024.12
49-
2026.01
47.6-
2026.01
45.4-
2024.09
44.9-
2026.01
43.9-
43-
2024.05
37-
2026.01
36.7-
2026.02
35.9-
2024.05
34.6-
2024.05
27.2-
2024.05
25.3-
2024.05
25.1-
2024.05
22.2-
2026.01
6.160.68
2026.01
5.48-