Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on GaoKao En 2023

79.7Pass@1 Accuracy

Pass@8 (Upper Bound)

17.61233.73149.8565.969Feb 18, 2025Mar 31, 2025May 11, 2025Jun 21, 2025Aug 1, 2025Sep 11, 2025Oct 22, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.05
79.7
2025.02
78.4
2025.10
72.3
2025.05
72.2
2025.05
71.9
2025.05
71.7
2025.10
71.4
2025.05
71
2025.05
70.9
2025.05
70.9
2025.05
70.6
2025.05
70.6
2025.05
70.5
2025.05
70.4
2025.02
70.1
2025.05
69.9
2025.02
67.5
2025.05
66.1
2025.05
65.8
2025.05
64.2
2025.10
64.2
2025.10
63.8
2025.05
63.6
2025.10
63.4
2025.10
62.9
2025.10
62.7
2025.10
62.6
2025.10
62.1
2025.10
60.8
2025.10
57.9
2025.10
57.4
2025.02
50.9
2025.10
46
2025.10
45.7
2025.02
45.2
2025.10
42.6
2025.10
35.1
2025.10
33.5
2025.10
20