| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Code Generation | HumanEval+ (test) | Pass@181.7 | 81 | |
| Code Generation | HumanEval+ v1 (test) | Pass Rate87.8 | 41 | |
| Unit test generation | HumanEval+ (test) | Error Rate1.27 | 7 | |
| Code Reasoning | HumanEval+ | Average Score @1682.29 | 6 | |
| Code Generation | HumanEval+ ko | Score92.1 | 3 |