Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Next-token reasoning on OMNI-MATH Easy (val)

76.89Accuracy

LoopRPT

8.052425.923743.79561.6663Mar 20, 2026
Updated 26d ago

Evaluation Results

MethodLinks
2026.03
76.894
2026.03
76.072.05
2026.03
75.384
2026.03
752.5
2026.03
74.624
2026.03
74.513.24
2026.03
74.43.34
2026.03
74.334
2026.03
47.49-
2026.03
10.7-