Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on AIME OpenR1-Math Harder 2024

72.7Accuracy

Qwen-4B

Updated 5mo ago

Evaluation Results

Method	Links
Qwen-4B 2026.02		72.7
RePO 2026.02		72.1
LUFFY 2026.02		61.5