Share your thoughts, 1 month free Claude Pro on usSee more

Python Coding on HumanEval (test)

67.7Accuracy

InternLM2-Chat-20B

Updated 3mo ago

Evaluation Results

Method	Links
InternLM2-Chat-20B 2024.03		67.7
InternLM2-Chat-20B-SFT 2024.03		67.1
InternLM2-Chat-7B-SFT 2024.03		61.6
InternLM2-Chat-7B 2024.03		59.2
ChatGLM3-6B 2024.03		53.1
Qwen-14B-Chat 2024.03		41.5
Qwen-7B-Chat 2024.03		36
Mistral-7B-Instruct-v0.2 2024.03		35.4
Mixtral-8x7B-Instruct-v0.1 2024.03		32.3
Baichuan2-13B-Chat 2024.03		19.5
Baichuan2-7B-Chat 2024.03		17.7
Llama2-7B-Chat 2024.03		15.2
Llama2-13B-Chat 2024.03		8.5