| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Sequential Recommendation | Tools (test) | HR@104.91 | 12 | |
| Generative Recommendation | Tools (Period 4) | H@52.46 | 8 | |
| Generative Recommendation | Tools (Period 3) | Hit Rate @ 52.18 | 8 | |
| Generative Recommendation | Tools (Period 2) | H@51.81 | 8 | |
| Generative Recommendation | Tools (Period 1) | Hit Rate @ 52.26 | 8 | |
| Task Planning | Tools PCD distribution (test) | Success Rate100 | 8 | |
| Task Planning | Tools PCD (train) | Success Rate100 | 8 | |
| Recommendation | Tools TIGER Backbone (Period 4) | H@52.46 | 7 | |
| Recommendation | Tools TIGER Backbone (Period 2) | H@52.33 | 7 | |
| Recommendation | Tools TIGER Backbone (Period 1) | H@52.39 | 7 | |
| Tool Use Accuracy | Seen Tools | SRt100 | 7 | |
| Human Evaluation | Tools 100 pairs | Win Rate88 | 1 |