Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

General Reasoning on MMLU-P

75.6Accuracy

Instruct

54.17659.73865.370.862Jan 31, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.01
75.6
2026.01
75.6
2026.01
74.2
2026.01
74.2
2026.01
73.8
2026.01
72.4
2026.01
72.2
2026.01
71.4
2026.01
71.4
2026.01
71.2
2026.01
70.8
2026.01
70.8
2026.01
68.6
2026.01
68.2
2026.01
67.8
2026.01
67
2026.01
65.8
2026.01
65.6
2026.01
63.2
2026.01
63
2026.01
61.8
2026.01
60.8
2026.01
56.8
2026.01
55