Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

General Reasoning on MMLU

95.1MMLU Accuracy

M2CL

63.972872.053980.13588.2161May 26, 2025Jul 8, 2025Aug 20, 2025Oct 3, 2025Nov 15, 2025Dec 28, 2025Feb 10, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.02
95.1-
2026.02
93.7-
2026.02
92.5-
2026.02
91.5-
2026.02
91.5-
2025.12
89-
2025.12
88.7-
2025.12
88.6-
2026.01
88.53,108.44
2026.02
88.2-
2025.12
87.9-
2025.12
87.9-
2025.12
87.8-
2025.12
87.4-
2025.12
87.4-
2025.12
87.1-
2026.02
87-
2026.02
86.8-
2025.12
86.7-
2026.02
86.3-
2025.12
86.2-
2025.12
86.2-
2025.12
85.7-
2025.12
85.7-
2026.02
85.6-
2026.02
85.6-
2026.01
85.56,753.47
2026.02
85-
2026.02
85-
2025.12
85-
2026.02
84.3-
2026.02
84.3-
2026.02
84.3-
2025.12
84.3-
2026.02
84.2-
2025.12
84-
2025.12
83.9-
2026.02
83.8-
2025.12
83.8-
2026.02
83.7-
2026.02
83.7-
2026.01
83.7-
2026.02
83-
2026.02
83-
2025.12
82.8-
2026.02
82.7-
2025.12
82.6-
2026.02
82.4-
2026.02
82.4-
2026.02
82.4-
2025.12
82.1-
2026.02
81.7-
2025.12
81.4-
2026.02
81-
2026.02
80.4-
2025.12
80.2-
2026.02
79.7-
2026.02
78.9-
2026.02
77.2-
2025.05
77.06-
2025.05
76.63-
2025.05
76.6-
2025.05
76.44-
2026.02
76.3-
2025.05
75.83-
2025.05
75.1-
2026.02
74.3-
2026.02
74.2-
2025.05
73.52-
2026.02
72.5-
2026.02
71.5-
2026.02
71.1-
2026.02
70.75-
2026.02
70.58-
2026.02
70.12-
2025.05
70.05-
2026.02
69.89-
2026.02
69.67-
2026.02
69.65-
2026.02
69.52-
2026.02
69.41-
2026.02
69.38-
2026.02
69.09-
2026.02
68.37-
2026.02
68.37-
2026.02
68.3-
2026.01
68.12,340.05
2026.01
67.64,988.84
2026.02
67.59-
2026.02
67.51-
2026.02
67.2-
2026.02
66.85-
2025.12
66.5-
2025.12
66.1-
2025.12
66-
2025.12
65.8-
2025.12
65.7-
2025.12
65.6-
2025.12
65.4-
2025.05
65.17-
Showing 100 of 126 rows