Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on ASDiv (test)

97.24Accuracy

SGE

2.350426.985251.6276.2548Feb 9, 2023Jun 12, 2023Oct 14, 2023Feb 14, 2024Jun 17, 2024Oct 18, 2024Feb 19, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2024.05
97.24
2024.05
94.08
2024.05
94.03
2024.05
93.5
2024.05
93.32
2024.05
93.1
2024.05
92.7
2024.05
90.2
2024.05
90.1
2024.02
81.9
2024.02
81.4
2024.02
80
2024.02
78.7
2024.02
78.6
2024.02
77.2
2024.02
76.8
2024.02
76.7
2024.02
73.9
2025.02
65.9
2024.02
65.8
2025.02
63.8
2025.02
62.3
2024.02
59.1
2024.02
58.6
2024.02
56.3
2024.02
50.7
2024.02
47.4
2025.02
46
2025.02
45.9
2025.02
45.2
2025.02
43.6
2025.02
41.7
2023.02
40.4
2023.02
14.8
2023.02
14
2023.02
9.6
2023.02
7.5
2023.02
6