| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Intent Alignment | ToolACE | Aintent (GPT-5.0)85.71 | 6 | |
| Intent Inversion Attack | ToolACE | S_text81.39 | 6 | |
| Tool Calling | ToolACE multi-turn (test) | Accuracy61.64 | 2 | |
| Tool Identification | ToolACE multi-turn (test) | Accuracy75.34 | 2 | |
| Function Calling | ToolAce (test) | Accuracy (Single)84.4 | 2 |