Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-task Language Understanding on MMLU (MMLU Score)

86.4MMLU Score

GPT-4

21.60838.42955.2572.071Jan 29, 2026Feb 2, 2026Feb 6, 2026Feb 10, 2026Feb 14, 2026Feb 18, 2026Feb 23, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
86.4
2026.01
70
2026.02
66.2
2026.02
63.7
2026.02
63
2026.02
62.7
2026.02
62.6
2026.02
62.3
2026.02
52.9
2026.02
52.3
2026.01
52.1
2026.02
51.7
2026.02
51.4
2026.02
51
2026.02
50.8
2026.01
48.1
2026.02
47.4
2026.01
47
2026.02
43.5
2026.02
39.5
2026.02
37.7
2026.02
34.5
2026.02
33.6
2026.02
25.8
2026.02
25.7
2026.02
25.1
2026.02
24.7
2026.02
24.1