Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on AIME '24 (Pass@1, Pass@32)

39.9Pass@1 Accuracy

ACTMat

21.38826.1943135.806Apr 1, 2026
Updated 16d ago

Evaluation Results

MethodLinks
2026.04
39.980
2026.04
39.880
2026.04
38.276.7
2026.04
36.873.3
2026.04
35.980
2026.04
33.476.7
2026.04
30.873.3
2026.04
22.166.7