| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multi-Agent Reinforcement Learning | CN rac-dist | Mean Episodic Reward888 | 21 | |
| Multi-Agent Reinforcement Learning | CN rdist | Mean Episodic Reward-161 | 21 | |
| Multi-Agent Reinforcement Learning | CN rdete | Mean Episodic Reward-154 | 21 | |
| Soft Query Answering | CN15k | 1P Score16.6 | 6 | |
| Backdoor Attack | CN (test) | Runtime (s)28.3 | 4 | |
| Intent Prediction | CN | Accuracy55.2 | 4 | |
| Function Invocation | CN Ver. Dual | Token Usage1,377.9 | 3 | |
| Function Invocation | CN (Single) | Invocation Accuracy0.89 | 3 |