Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Reasoning on MMLU STEM (test)

82.8Accuracy

Qwen2.5-Math-72B

22.79238.37153.9569.529Oct 16, 2023Dec 11, 2023Feb 5, 2024Apr 2, 2024May 28, 2024Jul 23, 2024Sep 18, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.09
82.8
2024.09
79.9
2024.09
79.1
2024.09
78.1
2024.09
67.8
2024.09
67.6
2024.09
65.7
2023.10
63.9
2024.09
63
2024.09
59.5
2024.09
56.5
2023.10
53.9
2024.09
53.1
2024.09
51.3
2024.09
50.4
2023.10
49
2024.09
44.8
2023.10
40.5
2023.10
37.7
2023.10
35.6
2023.10
29.9
2023.10
25.1