| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multi-Agent Inventory Management | Inventory Management Average (test) | Avg Relative Gap (Δ)134.53 | 6 | |
| Multi-Agent Inventory Management | Inventory Management Inc-Uni (test) | Relative Gap151.51 | 6 | |
| Multi-Agent Inventory Management | Inventory Management Inc-Div (test) | Relative Gap (Δ)104.13 | 6 | |
| Multi-Agent Inventory Management | Inventory Management Dec-Uni (test) | Relative Gap177.78 | 6 | |
| Multi-Agent Inventory Management | Inventory Management Dec-Div (test) | Relative Gap (Δ)48.8 | 6 | |
| Multi-Agent Inventory Management | Inventory Management Const-Uni (test) | Relative Gap (Delta)0 | 6 | |
| Inventory Management | Inventory Management Average GPT-5 (all scenarios) | Relative Gap93.77 | 6 | |
| Inventory Management | Inventory Management Inc-Uni GPT-5 (increasing-uniform) | Relative Gap (Δ)81.2 | 6 | |
| Inventory Management | Inventory Management increasing-diverse GPT-5 | Relative Gap (Δ)102.48 | 6 | |
| Inventory Management | Inventory Management decreasing-uniform GPT-5 | Relative Gap (Δ)173.33 | 6 | |
| Inventory Management | Inventory Management decreasing-diverse GPT-5 | Relative Gap (Δ)31.33 | 6 | |
| Inventory Management | Inventory Management Constant-Uniform GPT-5 | Relative Gap (Δ)0 | 6 | |
| Inventory Management | Inventory Management (test) | Const Uni103.33 | 6 | |
| Generative Modeling | inventory management dataset | KS0.0132 | 4 |