Share your thoughts, 1 month free Claude Pro on usSee more

Comprehensive Examination on C-Eval (test)

71.5Accuracy

Qwen-14B-Chat

Updated 4mo ago

Evaluation Results

Method	Links
Qwen-14B-Chat 2024.03		71.5
InternLM2-Chat-20B-SFT 2024.03		63.7
InternLM2-Chat-20B 2024.03		63
InternLM2-Chat-7B-SFT 2024.03		60.9
InternLM2-Chat-7B 2024.03		60.8
Qwen-7B-Chat 2024.03		59.8
ChatGLM3-6B 2024.03		59.1
Baichuan2-13B-Chat 2024.03		56.3
Mixtral-8x7B-Instruct-v0.1 2024.03		54
Baichuan2-7B-Chat 2024.03		53.9
GPT-3.5 2024.03		52.5
Mistral-7B-Instruct-v0.2 2024.03		42.4
Llama2-13B-Chat 2024.03		35
Llama2-7B-Chat 2024.03		34.9