Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on AIME24 (Total inference runtime)

0Total Inference Runtime (mm:ss)

S-GRPO

-0.241.3834.62Apr 3, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.04
0
2026.04
0
2026.04
0
2026.04
1
2026.04
1
2026.04
1
2026.04
1
2026.04
1
2026.04
1
2026.04
1
2026.04
1
2026.04
1
2026.04
1
2026.04
2
2026.04
2
2026.04
2
2026.04
2
2026.04
2
2026.04
2
2026.04
2
2026.04
2
2026.04
2
2026.04
2
2026.04
2
2026.04
2
2026.04
3
2026.04
3
2026.04
4
2026.04
4
2026.04
4
2026.04
4
2026.04
5
2026.04
5
2026.04
5
2026.04
6
2026.04
6