Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on AIME 2025 (reference set)

26.7pass@1

ScaleBiO

Updated 5mo ago

Evaluation Results

Method	Links
ScaleBiO 2024.06		26.7	36.7
Uniform 2024.06		20	33.3
LESS 2024.06		20	36.7
RHO-LOSS 2024.06		20	33.3