| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Locomotion | D4RL Cheetah Medium | Mean Return5,277.5 | 17 | |
| Offline-to-online Reinforcement Learning | D4RL Cheetah expert discretized | Online Normalized Score9.7 | 9 | |
| Offline-to-online Reinforcement Learning | D4RL Cheetah medium discretized | Online Score16.9 | 9 | |
| Locomotion | D4RL Cheetah Medium-Expert | Mean Return97.1 | 5 | |
| Locomotion | D4RL Cheetah Medium-Replay | Mean Return90.7 | 5 | |
| Locomotion | D4RL Cheetah Random | Mean Return77.1 | 5 | |
| Reinforcement Learning | D4RL Cheetah Medium-Expert | Mean Normalized Return98.7 | 5 |