Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on AIME 2024 (Reward-weighted Pass@1)

3.45Reward-weighted Pass@1

SFT

0.75641.45572.1552.8543Oct 2, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.10
3.45
2025.10
3.43
2025.10
3.41
2025.10
3.31
2025.10
3
2025.10
2.59
2025.10
1.21
2025.10
1.11
2025.10
1.05
2025.10
0.86