| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Tool Learning under Instructions Beyond Tool Capabilities | NoisyToolBench IBTC 1.0 (test) | A1 Score98 | 32 | |
| Tool Learning under Instruction with Error | NoisyToolBench IwE 1.0 (test) | A1 Success Rate74 | 32 | |
| Tool Learning under Instruction with Multiple Requests | NoisyToolBench IMR 1.0 (test) | A1 Score90 | 32 | |
| Tool Learning under Instruction with Missing Key Information | NoisyToolBench IMKI 1.0 (test) | A1 Success Rate94 | 32 | |
| Tool-using | NoisyToolBench IBTC | Average Steps1 | 32 | |
| Tool-using | NoisyToolBench IwE | Average Steps1.3 | 32 | |
| Tool-using | NoisyToolBench IMR | Average Steps1.03 | 32 | |
| Tool-using | NoisyToolBench IMKI | Average Steps1.26 | 32 |