Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Next-token reasoning on OMNI-MATH Hard (val)

38.1Accuracy

LoopRPT

6.213614.491822.7731.0482Mar 20, 2026
Updated 26d ago

Evaluation Results

MethodLinks
2026.03
38.14
2026.03
37.242.28
2026.03
34.823.07
2026.03
34.744
2026.03
34.524
2026.03
34.353.51
2026.03
33.913.75
2026.03
33.794
2026.03
19.19-
2026.03
7.44-