Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on AMC-23 (Accuracy, Tokens)

78.3Accuracy

LAPO-I

Updated 3mo ago

Evaluation Results

Method	Links
LAPO-I 2026.03		78.3	3,765
LAPO-D 2026.03		77.6	3,655
ThinkPrune-4k 2026.03		76.3	3,839
ThinkPrune-I2k 2026.03		74.3	2,913
DeepScaler-1.5B 2026.03		74.2	6,416
HAPO 2026.03		70.3	4,301
AutoThink 2026.03		67.8	3,658
Thinkless 2026.03		65.7	5,276
STRATAGEM 2026.04		60	-
Qwen3-4B-Base 2026.04		50	-
SPIRAL 2026.04		45	-