Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on AMC 23 (Avg@16, #Tokens)

93Average Accuracy @16

GR3

85.51287.45689.491.344Mar 11, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
933,090
2026.03
91.42,255
2026.03
90.37,256
2026.03
89.86,385
2026.03
88.12,427
2026.03
88.14,280
2026.03
85.82,963