Share your thoughts, 1 month free Claude Pro on usSee more

Code Generation on HumanEval compile (L0)

26.95Pass@1

ShieldedCode

Updated 5mo ago

Evaluation Results

Method	Links
ShieldedCode 2026.01		26.95	35.68
GPT-4o 2026.01		22.58	31.47
DeepSeekCoder-7B 2026.01		10.28	14.23
CodeLlama 2026.01		7.84	9.21
GPT-3.5-Turbo 2026.01		6.89	10.18
Meta LLMCompiler-7B 2026.01		6.42	7.64
StarCoder2-7B 2026.01		5.78	9.45
Qwen-2.5-Coder-7B 2026.01		5.31	7.12