| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Policy Optimization | Multi-Armed Bandits | Sample Complexity-7 | 8 | |
| Regret minimization | Multi-Armed Bandits (MABs) Stochastic i.i.d. setting | Metric- | 0 | |
| Multi-Armed Bandits | Static Unconstrained Multi-Armed Bandits (MAB) | Metric- | 0 |