| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Regret Minimization | KL-regularized Bandits | Sample Complexity2 | 2 | |
| Regret Minimization | KL-regularized Bandits Data Coverage | Regret2 | 1 | |
| Regret Minimization | KL-regularized Bandits Preference w/ Linear Reward | Regret2 | 1 | |
| Regret Minimization | KL-regularized Bandits Eluder Dimension | Metric- | 0 | |
| Regret Minimization | KL-regularized Bandits Preference w/ Eluder Dimension | Metric- | 0 |