Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on AIME 24 (accuracy, delta)

14Accuracy

Berr. Latent

-0.563.22710.78Feb 20, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
14-
2026.02
3.33.3
2026.02
1.5-
2026.02
1.5-
2026.02
0-