Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General Knowledge Assessment on C-Eval

92.5Accuracy

Kimi-K2 Base

20.2238.98557.7576.515Jun 19, 2024Oct 5, 2024Jan 22, 2025May 11, 2025Aug 28, 2025Dec 15, 2025Apr 3, 2026
Updated 13d ago

Evaluation Results

MethodLinks
2026.01
92.5
2026.04
91.5
2026.01
91
2026.01
90
2026.04
89.2
2026.04
88.7
2026.01
87.9
2026.04
87.8
2024.06
84.9
2024.06
83.8
2025.12
79.5
2025.12
79.3
2025.12
78.8
2024.07
78.68
2025.12
78.5
2025.12
77
2026.04
77
2024.06
76
2025.12
75.9
2024.06
75.1
2024.06
74.4
2024.06
73.9
2024.06
70.2
2024.07
68.57
2024.07
68.35
2024.07
68.2
2024.07
68.05
2024.07
67.6
2026.01
63.3
2026.01
62.7
2024.06
61.6
2024.06
60.2
2026.01
58
2026.02
54.27
2026.02
48.67
2026.02
47.6
2026.01
46.9
2024.06
38.2
2024.06
38.2
2024.06
36.2
2026.02
35.98
2024.06
35.2
2024.06
34.5
2024.06
24.2
2025.12
23
2025.12
23