Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Reasoning on MATH (test) (Accuracy + Time)

78.51Accuracy

Latent-GRPO

30.971643.313355.65567.9967Jan 13, 2026Jan 17, 2026Jan 22, 2026Jan 26, 2026Jan 31, 2026Feb 4, 2026Feb 9, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
78.51811.51
2026.01
77.531,081.47
2026.01
77.442,357.31
2026.01
65.771,608.34
2026.01
62.631,084.72
2026.01
58.47718.63
2026.01
55.63723.12
2026.01
52.941,224.15
2026.01
42.14814.22
2026.02
36.298,537
2026.02
34.2126,045
2026.02
32.8-
2026.02
32.8113,358