Code Generation and Functional Correctness on HumanEval

206.79Output Throughput

Our approach

Updated 4mo ago

Evaluation Results

Method	Links
Our approach 2026.03		206.79	2.2
128k-vocab 2026.03		202.31	-