Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Comprehensive Examination on CMMLU (test)
Loading...
68.1
Accuracy
Qwen-14B-Chat
29.204
39.302
49.4
59.498
Mar 26, 2024
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen-14B-Chat
Parameter size group=1...
2024.03
68.1
InternLM2-Chat-20B-SFT
Parameter size group=1...
2024.03
65.3
InternLM2-Chat-20B
Parameter size group=1...
2024.03
65.1
InternLM2-Chat-7B-SFT
Parameter size group=<...
2024.03
63.2
InternLM2-Chat-7B
Parameter size group=<...
2024.03
63
Qwen-7B-Chat
Parameter size group=<...
2024.03
57.9
ChatGLM3-6B
Parameter size group=<...
2024.03
57.8
Baichuan2-13B-Chat
Parameter size group=1...
2024.03
54.8
GPT-3.5
Model type=API Models,...
2024.03
53.9
Baichuan2-7B-Chat
Parameter size group=<...
2024.03
53.4
Mixtral-8x7B-Instruct-v0.1
Parameter size group=1...
2024.03
50.6
Mistral-7B-Instruct-v0.2
Parameter size group=<...
2024.03
42
Llama2-13B-Chat
Parameter size group=1...
2024.03
33.8
Llama2-7B-Chat
Parameter size group=<...
2024.03
30.7
Feedback
Search any
task
Search any
task