Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on AIME24 (Pass@1 Avg@32)

32.2Pass@1 Accuracy

Mix-RL

Updated 5mo ago

Evaluation Results

Method	Links
Mix-RL 2026.02		32.2
Unfold-RL 2026.02		32
Fold-RL 2026.02		31.3
Unfold-RL 2026.02		29.1
Fold-RL 2026.02		28.4
Mix-RL 2026.02		27.6
Unfold-RL 2026.02		27.5
Cold-Start 2026.02		26.7
Zero-RL 2026.02		25.8
Unfold-RL 2026.02		25.8
Zero-RL 2026.02		25.5
Cold-Start 2026.02		23.8
Cold-Start 2026.02		23
Cold-Start 2026.02		19.2