Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on AIME OpenR1-Math Harder 2024

72.7Accuracy

Qwen-4B

61.05264.07667.170.124Feb 11, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
72.7
2026.02
72.1
2026.02
61.5