| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Tool Retrieval | APIGen | NDCG@100.8575 | 44 | |
| Argument Generation | APIGen sampled (test) | Argument F1 (2 calls)88.1 | 15 | |
| Tool Selection | APIGen sampled (test) | Tool Selection F1 (2 calls)99.4 | 15 | |
| Tool-Calling and Answer Generation | APIGen-MT (test) | Action Recall90.18 | 4 | |
| Function Calling | APIGen (test) | Score (Single)89.6 | 2 |