Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on AIME 2024 (Pass@1 and Token Length)

79.8Pass@1 Accuracy

DeepSeek-R1

Updated 2mo ago

Evaluation Results

Method	Links
DeepSeek-R1 2025.03		79.8	9.6
TinyR1-32B-Preview 2025.03		78.1	11.8
DeepSeek-R1-Distill-Qwen-32B 2025.03		72.6	9.6
DeepSeek-R1-Distill-Llama-70B 2025.03		70	-