Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on AIME 25 (Accuracy and Token Count)

24.38AIME 25 Accuracy

MUR

9.643213.469117.29521.1209Jul 20, 2025
Updated 22d ago

Evaluation Results

MethodLinks
2025.07
24.3822,296
2025.07
20.213,113
2025.07
10.214,011