| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Code Generation | HumanEval and MBPP | HumanEval Score95.1 | 59 | |
| Code Generation | HumanEval and MBPP EvalPlus | HumanEval+ Pass@k70.1 | 29 | |
| Code Generation | HumanEval+ and MBPP+ | Score73.7 | 4 | |
| Code-writing | HumanEval & MBPP EvalPlus (test) | HumanEval Pass Rate39.02 | 4 |