Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on AIME decontaminated 24 (Accuracy)

27.9Accuracy

DAPO-Math-17k

3.77210.03616.322.564May 26, 2026
Updated 7d ago

Evaluation Results

MethodLinks
2026.05
27.9
2026.05
25.9
2026.05
23.4
2026.05
17.3
2026.05
17.1
2026.05
16.5
2026.05
10.4
2026.05
9.4
2026.05
8.4
2026.05
7
2026.05
6.9
2026.05
6.3
2026.05
5.8
2026.05
4.7