Share your thoughts, 1 month free Claude Pro on usSee more

Python Coding on HumanEval-X v1 (test)

48.2Pass@1

InternLM2-20B

Updated 3mo ago

Evaluation Results

Method	Links
InternLM2-20B 2024.03		48.2
InternLM2-7B 2024.03		39.6
ChatGLM3-6B-Base 2024.03		38.3
Mixtral-8x7B-v0.1 2024.03		38.3
InternLM2-20B-Base 2024.03		31.5
Qwen-14B 2024.03		31
InternLM2-7B-Base 2024.03		28.8
Mistral-7B-v0.1 2024.03		28.5
Qwen-7B-Chat 2024.03		24.4
Baichuan2-13B-Base 2024.03		19.5
Llama2-13B 2024.03		17.2
Baichuan2-7B-Base 2024.03		16.1
Llama2-7B 2024.03		11.2