| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| BFCL Multi-Turn v3 | APIGen-MT | Overall Score69.1 | 14 | 4d ago | |
| API-Bank | GenEnv | Success Rate79.1 | 12 | 4d ago | |
| MINT-Bench | LLAMA PRO - INSTRUCT | Success Rate (Turn 1)9.85 | 5 | 4d ago | |
| General Tool-Augmented LLM Capabilities Qualitative Comparison Survey | - | - | 0 | 4d ago |