Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on AIME (avg@16)

18.13Average Score (@16)

GRPO-SG

Updated 2mo ago

Evaluation Results

Method	Links
GRPO-SG 2025.10		18.13
GRPO-SG 2025.10		16.88
80/20 2025.10		16.4
GRPO 2025.10		15.63
AR 2025.10		15.52
Lopti 2025.10		15.45
Lopti 2025.10		15.27
80/20 2025.10		14.94
GRPO 2025.10		14.79
AR 2025.10		14.72
Qwen2.5-7B 2025.10		4.58