| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Contextual Bandit | Contextual Bandit Theoretical Bounds | Regret Scaling1 | 6 | |
| Regret Minimization | K-armed contextual bandit Logistic reward (theoretical bound) | Metric- | 0 | |
| Contextual Bandit | Contextual Bandit | Metric- | 0 |