Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language Understanding on CMMLU

90.1Accuracy

Qwen2-72B

21.56439.35757.1574.943Jan 11, 2024May 23, 2024Oct 3, 2024Feb 13, 2025Jun 26, 2025Nov 6, 2025Mar 19, 2026
Updated 29d ago

Evaluation Results

MethodLinks
2024.07
90.1-
2024.07
88.5-
88.3-
2024.07
84.8-
83.5-
2024.07
82.3-
2026.03
80.44-
2026.03
80.44-
2026.03
80.19-
2026.03
80.15-
2026.03
79.6-
2026.03
78.94-
2026.01
76.8-
2026.01
75.12-
2026.01
71.35-
2024.07
70.3-
2026.01
70.25-
2026.01
69.5-
67.2-
2026.01
65.42-
2024.07
57.8-
2024.07
57.34-
2024.07
55.1-
53.4-
2024.07
51.2-
2024.07
51.03-
2026.03
50.53-
2026.03
50.44-
2026.03
50.44-
2026.03
49.47-
2026.03
49.4-
2026.03
48.22-
2024.01
47.2-
2024.07
45.9-
2024.07
43.72-
2024.01
42.5-
2024.07
40.89-
2026.02
36.58-
2026.02
36.52-
2026.02
35.89-
2024.07
25.53-
2024.07
24.2-
2023.08
-26.8
2023.08
-31.8
2023.08
-44.4
2023.08
-57.1
2023.08
-48.8
2023.08
-51.8
2023.08
-62.2
2023.08
-49.5
2024.01
-36.9
2024.01
-51.2
2024.01
-49.3
2024.01
-40.6
2024.01
-25.4
2024.01
-35.9
-31.9
2024.01
-32.6
2024.01
-42.5
2026.02
-34.64
2026.02
-35.16
2026.02
-35.17
2026.02
-34.22
2026.02
-33.4
2026.02
-34.5
2026.02
-32.73
2026.02
-33.07
2026.02
-32.91