Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Math Problem Solving on MMATH (test)

27.1Accuracy (Ar)

Qwen2.5-7B-Instruct + Vanilla GRPO

11.81215.78119.7523.719May 21, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.05
27.127.431.625.528.331.128.528.731.327.930.229.528.9
2026.05
26.528.631.228.228.432.129.230.125.232.431.329.829.4
2026.05
24.720.527.826.5263126.125.92529.928.627.326.6
2026.05
24.323.628.22528.230.226.625.22529.227.926.926.7
2026.05
22.72122.222.123.932.624.121.721.222.925.322.823.6
2026.05
21.420.722.724.525.728.623.924.622.826.726.725.224.4
2026.05
21.219.926.423.623302421.623.124.325.523.623.9
20.321.524.125.423.828.223.920.824.624.125.123.623.8
2026.05
19.118.724.720.426.127.822.82319.623.72422.622.7
2026.05
18.819.123.218.822.429.62219.92123.62321.921.9
2026.05
18.615.719.518.519.422.4191620.420.119.118.919
2026.05
16.215.517.816.317.522.917.716.415.520.119.517.917.8
2026.05
15.114.820.715.520.223.118.219.615.120.919.618.818.5
2026.05
1515.318.212.217.119.916.316.314.518.616.716.516.4
2026.05
13.713.316.512.515.921.315.513.1111616.714.215
13.11116.713.715.12315.415.91318.516.61615.7
2026.05
12.912.118.513.31722.41613.412.116.519.115.315.7
2026.05
12.811.619.711.915.522.415.71512.217.217.715.515.6
2026.05
12.612.61712.717.423.215.914.912.517.817.915.815.8
2026.05
12.411.417.213.217.922.315.714.112.316.818.615.515.6