| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Interactive Decision Making | InterCode NL2Bash | Success Rate79.6 | 40 | |
| interactive SQL querying | InterCode SQL | Avg Reward69.1 | 10 | |
| SQL Code Generation | InterCode SQL | Success Rate7.3 | 7 | |
| Bash Command Execution | InterCode Bash | Execution Success Rate72 | 4 |