| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Continuous Control | MuJoCo Walker2d v4 | Normalized Performance13,060 | 39 | |
| Offline Reinforcement Learning | MuJoCo Walker2d Friction shift | Normalized Score76.96 | 32 | |
| Offline Reinforcement Learning | MuJoCo Walker2d Gravity shift | Normalized Score69.48 | 32 | |
| Offline Reinforcement Learning | MuJoCo walker2d medium-replay D4RL | Normalized Return94.1 | 20 | |
| Offline Reinforcement Learning | MuJoCo walker2d medium-expert D4RL | Normalized Return116.6 | 18 | |
| Reinforcement Learning | MuJoCo Walker2d v2 | Average Return8,004 | 18 | |
| Reinforcement Learning | MuJoCo Walker2d v5 | Mean Episodic Return5,222 | 17 | |
| Locomotion | MuJoCo Walker2d Medium-Replay D4RL | Average Normalized Score128.6 | 16 | |
| Continuous control locomotion | MuJoCo Walker2d v3 (train) | Final Return6,482.6 | 12 | |
| Continuous Control | MuJoCo Walker2d (H=10) | Normalized Return14.9 | 10 | |
| Locomotion | MuJoCo Walker2d Friction shift | Normalized Return40.8 | 8 | |
| Locomotion | MuJoCo Walker2d Kinematic shift | Normalized Return56.4 | 8 | |
| Locomotion | MuJoCo Walker2d Morphology shift | Normalized Return50.5 | 8 | |
| Offline Reinforcement Learning | MuJoCo walker2d medium 1M | Final Score85.4 | 7 | |
| Reinforcement Learning | MuJoCo Walker2d 1.5 density v1 (test) | Reward2,674 | 7 | |
| Continuous Control | MuJoCo Walker2d 10-p v4 | Normalized Return102 | 6 | |
| Continuous Control | MuJoCo Walker2d 4-p v4 | Normalized Return94.2 | 6 | |
| Continuous Control | MuJoCo Walker2d v2 (train) | Mean Return5,278 | 6 | |
| Reinforcement Learning | Sparse MuJoCo Walker2d v2 (test) | Max Return886.6 | 6 | |
| Reinforcement Learning | MuJoCo Walker2d epsilon=0.05 (test) | Natural Return4,875 | 5 | |
| Offline Inverse Reinforcement Learning | MuJoCo walker2d medium-exp | Average Reward5,383.98 | 5 | |
| Offline Inverse Reinforcement Learning | MuJoCo walker2d (medium-replay) | Avg Reward5,383.98 | 5 | |
| Offline Inverse Reinforcement Learning | MuJoCo walker2d medium | Avg Reward5,383.98 | 5 | |
| Continuous Control | MuJoCo Walker2d 1M steps v3 | Average Return5,099 | 5 | |
| Continuous Control | MuJoCo Walker2d v3 (500K steps) | Average Return4,034 | 5 |