| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Reinforcement Learning | Atari Pong | Mean Episode Return21 | 19 | |
| Offline Reinforcement Learning | Atari Pong | Episode Return18.8 | 6 | |
| Offline Reinforcement Learning | Atari Pong 1% DQN-replay v0 | Gamer-Normalized Score111.9 | 5 | |
| Imitation Learning | Atari Pong v4 (test) | Reward21 | 2 |