| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Tool Use | ToolAlpaca | Tool Use Success Rate77.9 | 26 | |
| Tool-use Inference | ToolAlpaca | Match Rate5.26 | 22 | |
| Tool-use reasoning | ToolAlpaca | Accuracy66.73 | 20 | |
| Tool selection | ToolAlpaca | Accuracy97.42 | 20 | |
| Tool usage simulation | ToolAlpaca evaluation | Procedure Score78.38 | 12 |