| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Code Generation | HumanEval+ (test) | Pass@198.1 | 132 | |
| Code Generation | HumanEval+ v1 (test) | Pass Rate87.8 | 55 | |
| Code Reasoning | HumanEval+ | Pass@1697 | 12 | |
| Unit test generation | HumanEval+ (test) | Error Rate1.27 | 7 | |
| Code Generation | HumanEval+ | Score34.76 | 5 | |
| Code Generation | HumanEval+ ko | Score92.1 | 3 |