Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on AIME 24 (Avg@32, #Token)

45.2Average Score (Top-32)

GR³

22.63228.49134.3540.209Mar 11, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
45.28,381
2026.03
39.613,054
2026.03
34.33,839
2026.03
34.29,204
2026.03
30.15,770
2026.03
3016,531
2026.03
23.59,071