| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| ToolBench Average over all sets | GPT4 TOPGUN | Win Rate86.54 | 13 | 4d ago | |
| ToolBench G3-Instruction | GPT4 TOPGUN | Win Rate0.9368 | 13 | 4d ago | |
| ToolBench G2-Category | GPT4 TOPGUN | Win Rate78.78 | 13 | 4d ago | |
| ToolBench G2-Instruction | GPT4 TOPGUN | Win Rate87.59 | 13 | 4d ago |