Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on LogicVista

73.8Accuracy

Gemini-2.5-Pro*

24.71237.45650.262.944Sep 30, 2025Oct 9, 2025Oct 18, 2025Oct 28, 2025Nov 6, 2025Nov 15, 2025Nov 25, 2025
Updated 26d ago

Evaluation Results

MethodLinks
2025.09
73.8--
2025.09
70--
2025.11
64.4--
2025.11
54.6--
2025.11
52.8--
2025.11
52.3--
2025.11
51--
2025.09
50.9--
2025.09
49.7--
2025.11
49.7--
2025.11
49.3--
2025.11
49.2--
2025.09
48.5--
2025.11
46.1--
2025.09
45.6--
2025.11
44.5--
2025.11
43.6--
2025.11
43.6--
2025.11
42.7--
2025.11
42.7--
2025.09
42.6--
2025.11
41.4--
2025.09
39.7--
2025.09
38.3--
2025.11
35.6--
2025.11
32--
2025.11
26.6--
2026.04
-7081.5
2026.04
-73.876.4
2026.04
-42.659.3
2026.04
-38.344.7
2026.04
-45.652.5
2026.04
-48.554.5
2026.04
-49.753.8
2026.04
-42.850.3
2026.04
-50.661.5