| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Unregularized Reinforcement Learning | Tabular MDP Finite State Action Spaces | Sample Complexity1 | 3 | |
| Entropy-Regularized Reinforcement Learning | Tabular MDP Finite State Action Spaces | Sample Complexity1 | 2 | |
| Regret minimization in Reinforcement Learning | Tabular MDP | Cumulative Regret3 | 1 | |
| Reinforcement Learning | Tabular MDP | Sample Complexity0.5 | 1 |