Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on AIME 2024 (Pass@1 and Token Length)

79.8Pass@1 Accuracy

DeepSeek-R1

69.60872.25474.977.546Mar 6, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
79.89.6
2025.03
78.111.8
2025.03
72.69.6
2025.03
70-