Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on AIME 2024 (reference set)

33.3Pass@1

ScaleBiO

Updated 5mo ago

Evaluation Results

Method	Links
ScaleBiO 2024.06		33.3	33.3
RHO-LOSS 2024.06		30	33.3
Uniform 2024.06		26.7	33.3
LESS 2024.06		26.7	33.3