Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Reasoning on MGSM-zh (test)

79.6Accuracy

DeepSeekMath-RL

39.66450.03260.470.768Feb 5, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.02
79.6
2024.02
78.4
2024.02
76.4
2024.02
74
2024.02
73.2
2024.02
72
2024.02
66.4
2024.02
64.8
2024.02
64.8
2024.02
41.2