Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reasoning on MMLU-Pro (Accuracy and Resource Usage)

40Avg Samples

SC

5.26414.28223.332.318Feb 10, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
40-5.756.62
2026.02
13.7-4.256.59
2026.02
11.11.2356.6
2026.02
9.8-5.356.59
2026.02
6.6-2.956.36