| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Near-optimal policy identification | MDP | Metric- | 0 | |
| Transfer Reinforcement Learning | MDP with Tucker rank (d, d, d) | Metric- | 0 | |
| Transfer Reinforcement Learning | MDP with Tucker rank (S, S, d) | Metric- | 0 | |
| Compute epsilon-optimal policy | MDP sample setting | Metric- | 0 |