| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Reinforcement Learning | MuJoCo Walker | Average Return6,115 | 14 | |
| Continuous Control | MuJoCo Walker fixed random adversary L=0.1 | Avg Performance5,278 | 12 | |
| Reinforcement Learning | MuJoCo Walker (test) | Average Performance4,888 | 12 | |
| Continuous Control | MuJoCo Walker logarithmic adversary v1 | Average Performance4,931 | 12 | |
| Inverse Reinforcement Learning | MuJoCo Walker (test) | Average Performance5,423 | 4 | |
| Locomotion | MuJoCo Walker (t = T) | Average Return755 | 3 | |
| Locomotion | MuJoCo Walker t = 3T/4 | Average Return631.6 | 3 | |
| Locomotion | MuJoCo Walker t = 2T/3 (shift) | Average Return411.6 | 3 | |
| Locomotion | MuJoCo Walker t = T/2 (shift) | Average Return323.8 | 3 |