Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Code Generation on HumanEval (Accuracy)

54.27Accuracy

Llama3.1-8B-Instruct

38.4142.527546.64550.7625Jan 30, 2026
Updated 25d ago

Evaluation Results

MethodLinks
2026.01
54.27
2026.01
52.44
2026.01
43.29
2026.01
40.85
2026.01
40.85
2026.01
39.63
2026.01
39.63
2026.01
39.63
2026.01
39.02
2026.01
39.02