Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on MGSM-zh (test)

89.6Accuracy

JT-Safe-V2-35B

Updated 2mo ago

Evaluation Results

Method	Links
JT-Safe-V2-35B 2026.05		89.6
SOTA with Equivalent Parameters 2026.05		89.2
DeepSeekMath-RL 2024.02		79.6
DeepSeekMath-RL 2024.02		78.4
DeepSeek-LLM-Chat 2024.02		76.4
DeepSeek-LLM-Chat 2024.02		74
DeepSeekMath-Instruct 2024.02		73.2
DeepSeekMath-Instruct 2024.02		72
MetaMath 2024.02		66.4
SeaLLM-v2 2024.02		64.8
WizardMath-v1.0 2024.02		64.8
ToRA 2024.02		41.2