Share your thoughts, 1 month free Claude Pro on usSee more

Code Generation on HumanEval (Accuracy)

54.27Accuracy

Llama3.1-8B-Instruct

Updated 4mo ago

Evaluation Results

Method	Links
Llama3.1-8B-Instruct 2026.01		54.27
Qwen2.5-7B-Instruct 2026.01		52.44
LLaDA1.5-8B 2026.01		43.29
LLaDA1.5-8B 2026.01		40.85
LLaDA-8B-Instruct 2026.01		40.85
LLaDA1.5-8B 2026.01		39.63
LLaDA-8B-Instruct 2026.01		39.63
LLaDA-8B-Instruct 2026.01		39.63
LLaDA1.5-8B 2026.01		39.02
LLaDA-8B-Instruct 2026.01		39.02