| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Offline Reinforcement Learning | Gym Locomotion (Medium) | HalfCheetah Score66.8 | 14 | |
| Offline Reinforcement Learning | Gym Locomotion (Medium-Replay) | HalfCheetah Return45.4 | 8 | |
| Offline Reinforcement Learning | Gym Locomotion Medium-Expert | HalfCheetah Score95 | 8 | |
| Locomotion | Gym-Locomotion | Ant Score8,509 | 5 |