| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multi-armed Bandit Regret Minimization | K-armed bandits | Minimax Ratio1 | 3 | |
| Regret minimization | K-armed bandits Exponential Family rewards | Finite-Time Regret (Minimax Ratio)1 | 2 | |
| Regret minimization | K-armed bandits [0, 1] rewards | Finite-Time Regret (Minimax Ratio)1 | 2 | |
| Regret minimization | K-armed bandits Gaussian rewards | Finite-Time Regret (Minimax Ratio)1 | 1 | |
| Regret minimization | K-armed bandits sub-Gaussian rewards | Metric- | 0 |