Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Next-token Reasoning on OMNI-MATH Medium (val)

61.15Accuracy (Next-token Reasoning)

LoopRPT

6.49820.686534.87549.0635Mar 20, 2026
Updated 26d ago

Evaluation Results

MethodLinks
2026.03
61.154
2026.03
60.212.18
2026.03
58.294
2026.03
57.722.81
2026.03
57.353.35
2026.03
57.284
2026.03
57.23.53
2026.03
57.194
2026.03
32.18-
2026.03
8.6-