Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on AIME 2025 (Mean@32 accuracy)

66Mean@32 Accuracy

HTPO

Updated 2mo ago

Evaluation Results

Method	Links
HTPO 2026.05		66
DAPO 2026.05		64.7
GSPO 2026.05		63.7
SAPO 2026.05		63.6
80/20-Rule 2026.05		63.4
BAPO 2026.05		59.2
GRPO† 2026.05		58.6
HTPO 2026.05		30.4
GSPO 2026.05		30.3
SAPO 2026.05		28.4
80/20-Rule 2026.05		25.9
BAPO 2026.05		24.7
DAPO 2026.05		23.7
GRPO† 2026.05		22.9