Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Code Generation and Functional Correctness on HumanEval

206.79Output Throughput

Our approach

202.1308203.3404204.55205.7596Mar 5, 2026
Updated 2mo ago

Evaluation Results

MethodLinks
2026.03
206.792.2
2026.03
202.31-