Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Comprehensive Examination on C-Eval (test)
Loading...
71.5
Accuracy
Qwen-14B-Chat
33.436
43.318
53.2
63.082
Mar 26, 2024
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen-14B-Chat
Parameter size group=1...
2024.03
71.5
InternLM2-Chat-20B-SFT
Parameter size group=1...
2024.03
63.7
InternLM2-Chat-20B
Parameter size group=1...
2024.03
63
InternLM2-Chat-7B-SFT
Parameter size group=<...
2024.03
60.9
InternLM2-Chat-7B
Parameter size group=<...
2024.03
60.8
Qwen-7B-Chat
Parameter size group=<...
2024.03
59.8
ChatGLM3-6B
Parameter size group=<...
2024.03
59.1
Baichuan2-13B-Chat
Parameter size group=1...
2024.03
56.3
Mixtral-8x7B-Instruct-v0.1
Parameter size group=1...
2024.03
54
Baichuan2-7B-Chat
Parameter size group=<...
2024.03
53.9
GPT-3.5
Model type=API Models,...
2024.03
52.5
Mistral-7B-Instruct-v0.2
Parameter size group=<...
2024.03
42.4
Llama2-13B-Chat
Parameter size group=1...
2024.03
35
Llama2-7B-Chat
Parameter size group=<...
2024.03
34.9
Feedback
Search any
task
Search any
task