Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

General Knowledge Reasoning on MMLU-Pro

84.6Accuracy

DeepSeek v3.2

48.470457.850267.2376.6098Dec 18, 2025Dec 25, 2025Jan 1, 2026Jan 8, 2026Jan 15, 2026Jan 22, 2026Jan 29, 2026
Updated 3d ago

Evaluation Results

MethodLinks
84.6--
2025.12
83.5--
2025.12
83.1--
2025.12
81.9--
2026.01
76.0211
2026.01
75.840.230.49
2026.01
75.720.472.32
2026.01
75.370.381.92
75.3--
73.9--
2026.01
73.40.593.88
2026.01
72.20.220.4
2026.01
72.150.584.99
2026.01
71.9811
2026.01
71.90.342.23
2026.01
71.720.32.36
2026.01
71.140.351.79
2026.01
70.380.392.97
2025.12
67.1--
2026.01
60.1811
2026.01
60.10.230.39
2026.01
60.030.613.21
2026.01
59.880.322.03
2026.01
59.880.281.78
2026.01
59.590.43.74
2026.01
52.630.352.16
2026.01
52.580.313.17
2026.01
52.5411
2026.01
52.370.220.42
2026.01
520.262.83
2026.01
49.860.634.16