Share your thoughts, 1 month free Claude Pro on usSee more

Python Coding on HumanEval v1 (test)

56.1Pass@1

InternLM2-7B

Updated 3mo ago

Evaluation Results

Method	Links
InternLM2-7B 2024.03		56.1
InternLM2-20B 2024.03		48.8
ChatGLM3-6B-Base 2024.03		45.1
Qwen-7B-Chat 2024.03		36
Mixtral-8x7B-v0.1 2024.03		32.3
InternLM2-20B-Base 2024.03		32.3
InternLM2-7B-Base 2024.03		31.1
Qwen-14B 2024.03		30.5
Mistral-7B-v0.1 2024.03		27.4
Baichuan2-13B-Base 2024.03		23.2
Baichuan2-7B-Base 2024.03		22
Llama2-13B 2024.03		18.9
Llama2-7B 2024.03		14.6