| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Reinforcement Learning | MuJoCo Hopper v2 | Average Return4,408 | 18 | |
| Continuous Control | MuJoCo Hopper logarithmic adversary v1 | Average Performance Score2,577 | 12 | |
| Continuous Control | MuJoCo Hopper H=20 | Normalized Return33.3 | 10 | |
| Continuous Control | MuJoCo Hopper H=10 | Normalized Return13.2 | 10 | |
| Offline Reinforcement Learning | MuJoCo Hopper Medium-Replay v2 | Avg Normalized Score100.02 | 8 | |
| Offline Reinforcement Learning | MuJoCo Hopper Medium-Expert v2 | Avg Normalized Score107 | 7 | |
| Offline Reinforcement Learning | MuJoCo Hopper Medium v2 | Averaged Normalized Score102 | 7 | |
| Continuous Control | MuJoCo Hopper 2-p v4 | Normalized Return106 | 6 | |
| Continuous Control | MuJoCo Hopper 4-p v4 | Normalized Return99 | 6 | |
| Continuous Control | MuJoCo Hopper v2 (train) | Mean Return3,713 | 6 | |
| Reinforcement Learning | MuJoCo Hopper epsilon=0.075 (test) | Natural Return3,684 | 5 | |
| Offline Inverse Reinforcement Learning | MuJoCo hopper (medium-exp) | Average Reward3,512.09 | 5 | |
| Offline Inverse Reinforcement Learning | MuJoCo hopper (medium-replay) | Average Reward3,512.09 | 5 | |
| Offline Inverse Reinforcement Learning | MuJoCo hopper medium | Average Reward3,512.09 | 5 | |
| Continuous Control | MuJoCo Hopper v3 (1M steps) | Average Return3,687 | 5 | |
| Continuous Control | MuJoCo Hopper v3 (500K steps) | Average Return3,548 | 5 | |
| Policy Optimization | MuJoCo Hopper H=40 | Return71 | 5 | |
| Continuous Control | MuJoCo Hopper (H=40) | Normalized Return71 | 5 | |
| Dynamics Model Prediction | MuJoCo Hopper medium-replay v2 (test) | RMSE0.408 | 4 | |
| Dynamics Model Prediction | MuJoCo Hopper expert v2 (test) | RMSE0.322 | 4 | |
| Dynamics Model Prediction | MuJoCo Hopper medium v2 (train) | RMSE0.034 | 4 | |
| Policy Gradient | MuJoCo Hopper (final 20 iterations) | Average Return185.9 | 3 | |
| Continuous control locomotion | MuJoCo Hopper v3 (train) | Avg Performance (1M Steps)2,544 | 2 | |
| Reinforcement Learning | MuJoCo Hopper v4 | Metric- | 0 |