Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Reasoning on AIME24 (Average Accuracy)

23.5Average Accuracy

GRPO

11.0214.2617.520.74Dec 1, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
23.5
2025.12
21
2025.12
11.5