| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Reinforcement Learning | Gym-MuJoCo Walker2D | Average Return4,909 | 10 | |
| Offline Reinforcement Learning | Gym-MuJoCo Aggregate | Aggregate Score77.24 | 6 | |
| Offline Reinforcement Learning | Gym-MuJoCo Full-replay | HalfCheetah Return84.3 | 6 | |
| Offline Reinforcement Learning | Gym-MuJoCo Expert | HalfCheetah105.9 | 6 | |
| Offline Reinforcement Learning | Gym-MuJoCo Random | HalfCheetah30.9 | 6 | |
| Continuous Control | Gym MuJoCo Locomotion | TD3 Normalized Score1 | 5 |