Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on AIME 25 (avg@16)

25.3Avg@16

Scaf-GRPO

Updated 4mo ago

Evaluation Results

Method	Links
Scaf-GRPO 2025.10		25.3
Vanilla GRPO 2025.10		22.9
Scaf-GRPO 2025.10		14.6
LUFFY 2025.10		12
Oat-Zero 2025.10		11.5
SimpleRL-Zero 2025.10		11
Scaf-GRPO 2025.10		11
Scaf-GRPO 2025.10		10.8
Vanilla GRPO 2025.10		10.8
Vanilla GRPO 2025.10		9.5
Vanilla GRPO 2025.10		8.2
Scaf-GRPO 2025.10		0.3
Vanilla GRPO 2025.10		0