Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Problem Solving on MATH (Sub-domain Breakdown)

0.362Overall Accuracy

DeepSeekMath-Base

0.019840.108670.19750.28633Feb 5, 2024Feb 13, 2024Feb 21, 2024Mar 1, 2024Mar 9, 2024Mar 17, 2024Mar 26, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.02
0.362-------
2024.02
0.336-------
2024.02
0.276-------
2024.03
0.255-------
2024.02
0.253-------
2024.03
0.251-------
2024.03
0.227-------
2024.03
0.202-------
2024.03
0.192-------
2024.02
0.181-------
2024.02
0.143-------
2024.02
0.141-------
2024.03
0.136-------
2024.03
0.134-------
2024.03
0.113-------
2024.03
0.101-------
2024.03
0.089-------
2024.03
0.055-------
2024.03
0.049-------
2024.03
0.033-------
2022.04
-0.0320.0360.0270.0240.0440.0520.013
2022.04
-0.0490.030.0150.0210.0650.0570.027