Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on DynaMath

81.42Accuracy

Gemini2.5-Pro

9.60828.251546.89565.5385May 20, 2025Jul 11, 2025Sep 2, 2025Oct 25, 2025Dec 17, 2025Feb 8, 2026Apr 2, 2026
Updated 16d ago

Evaluation Results

MethodLinks
2026.03
81.42
2026.03
79.5
2026.03
77.41
2026.03
77.39
2026.03
75.78
2026.03
75.54
2026.03
75.05
2026.03
73.69
2026.03
73.25
2026.03
73.2
2026.03
72.4
2026.03
72.19
2026.03
71.67
2026.03
71.38
2026.03
71.22
2026.03
71
2026.03
69.4
2026.03
68.28
2026.03
68.12
2026.03
66.17
2026.03
65.44
2026.03
63.61
2026.03
62.37
2026.04
57.71
2026.04
57.11
2026.04
56.19
2026.04
55.96
2026.04
55.01
2025.05
55
2026.04
54.97
2026.04
54.89
2026.04
54.84
2026.04
54.8
2025.05
53.3
2025.10
53.3
2025.10
51.9
2025.10
51.3
2025.10
51.2
2026.04
48.45
2026.04
48.15
2026.04
47.82
2026.04
47.75
2026.04
47.3
2026.04
45.86
2025.10
42.5
2025.10
41.9
2025.10
40.7
2025.10
39.3
2025.10
37
2025.10
36.7
2026.04
32.88
2025.10
30.3
2026.03
29.74
2025.10
29.3
2025.10
28.5
2026.03
27.94
2026.03
27.14
2026.03
26.14
2026.03
25.34
2026.03
25.14
2026.03
24.75
2026.03
24.55
2026.03
24.55
2026.03
23.75
2026.03
23.55
2026.03
23.15
2026.03
22.36
2026.03
20.35
2026.03
20.35
2026.03
19.36
2026.03
18.16
2026.03
17.96
2026.03
17.96
2026.03
16.36
2026.03
12.37