| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Tool-use decision-making | MetaTool (test) | Decision Accuracy90.2 | 38 | |
| Tool selection | MetaTool similar choices subtask (test) | Accuracy83.4 | 8 | |
| Adaptive Tool Use | MetaTool | Tool Invocations Count520 | 8 | |
| Tool Selection | MetaTool 199 tools, 1,287 queries (30% test) | R@183 | 7 | |
| Tool Selection Attack | MetaTool (test) | TDR97.2 | 3 |