Share your thoughts, 1 month free Claude Pro on usSee more

Next-token reasoning on OMNI-MATH Hard (val)

38.1Accuracy

LoopRPT

Updated 4mo ago

Evaluation Results

Method	Links
LoopRPT 2026.03		38.1	4
LoopRPT 2026.03		37.24	2.28
LoopRPT 2026.03		34.82	3.07
LoopRPT 2026.03		34.74	4
Peak 2026.03		34.52	4
Adap. 2026.03		34.35	3.51
Adap. 2026.03		33.91	3.75
Peak 2026.03		33.79	4
Vanilla 2026.03		19.19	-
+CoT 2026.03		7.44	-