| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Code Generation | LCB v6 | Accuracy57.1 | 49 | |
| Code Generation | LCB v5 | Accuracy61.5 | 45 | |
| Code Generation | LCB v6 | Pass@150.68 | 39 | |
| Code Generation | LCB | Speedup7.81 | 33 | |
| Code Reasoning | LCB v6 | Accuracy53.6 | 26 | |
| Code Generation | LCB | Accuracy85.5 | 26 | |
| Code Reasoning | LCB | pass@162.46 | 26 | |
| Code | LCB v6 | Score87.7 | 20 | |
| Code reasoning | LCB v5 | Accuracy33.72 | 16 | |
| Code Generation | LCB | Throughput66.41 | 16 | |
| Code Generation | LCB v6 (val) | Accuracy (%)78 | 13 | |
| Code | LCB | Speedup8.37 | 12 | |
| Code Generation | LCB | Pass@130.1 | 11 | |
| Code Generation | LCB v5 | pass@149.5 | 11 | |
| Code Generation | LCB 2408-2505 | Pass@174.6 | 11 | |
| Code Generation | LCB (test) | Accuracy21.69 | 10 | |
| Code Generation | LCB v5 | Score37.63 | 9 | |
| Reasoning | LCB | Score59.58 | 9 | |
| Code Generation | LCB-Hard (171) | Accuracy62.6 | 8 | |
| Code-reasoning | LCB v6 | Pass@17.7 | 8 | |
| Code Generation | LCB-IO | Pass@181.7 | 8 | |
| Code | LCB Pro Med 25Q2 | pass@110.5 | 7 | |
| Code | LCB 08/24-02/25 v5 | pass@177.5 | 7 | |
| Coding | LCB v6 | Pass@131.43 | 6 | |
| Coding | LCB v5 | Pass@158.42 | 6 |