| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Code Translation | CodeLingua Java → Fortran | Pass@158.4 | 6 | |
| Code Translation | CodeLingua Python → C | Pass@1 Accuracy86 | 6 | |
| Code Translation | CodeLingua Java → C | Pass@194 | 6 | |
| Reasoning failure prediction | CodeLingua (L3) | Accuracy76 | 2 | |
| Reasoning failure prediction | CodeLingua (L2) | Accuracy75 | 2 | |
| Reasoning failure prediction | CodeLingua (L1) | Accuracy73 | 2 |