Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General Knowledge on CMMLU
Loading...
88.4
Accuracy
GLM 4.6
84.136
85.243
86.35
87.457
Dec 25, 2025
Dec 26, 2025
Dec 27, 2025
Dec 28, 2025
Dec 29, 2025
Dec 30, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GLM 4.6
Evaluation Mode=Chat
2025.12
88.4
LongCat-Flash Exp-Chat
Evaluation Mode=Chat
2025.12
87.5
DeepSeek V3.2
Evaluation Mode=Chat
2025.12
87.3
Qwen3-14B-Base
Backbone=Qwen3-14B-Bas...
2025.12
84.92
LPS-CPT
Backbone=Qwen3-14B-Bas...
2025.12
84.79
DOS-CPT
Backbone=Qwen3-14B-Bas...
2025.12
84.78
HPS-CPT
Backbone=Qwen3-14B-Bas...
2025.12
84.53
RS-CPT
Backbone=Qwen3-14B-Bas...
2025.12
84.5
LongCat-Flash Chat
Evaluation Mode=Chat
2025.12
84.3
Feedback
Search any
task
Search any
task