Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on AIME25 (Pass@1 (Avg@32))

28.3Pass@1 (Avg@32)

Mix-RL

Updated 5mo ago

Evaluation Results

Method	Links
Mix-RL 2026.02		28.3
Mix-RL 2026.02		28
Unfold-RL 2026.02		27.8
Fold-RL 2026.02		27.8
Fold-RL 2026.02		26.9
Unfold-RL 2026.02		26.7
Cold-Start 2026.02		25.4
Unfold-RL 2026.02		25.1
Unfold-RL 2026.02		25
Cold-Start 2026.02		24.6
Cold-Start 2026.02		23.1
Zero-RL 2026.02		22.5
Cold-Start 2026.02		22
Zero-RL 2026.02		18.1