| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Reinforcement Learning | Hopper v5 | Average Return3,732.5 | 101 | |
| Offline Reinforcement Learning | hopper medium | Normalized Score3,729 | 58 | |
| Offline Reinforcement Learning | hopper medium-replay | Normalized Score113 | 44 | |
| Offline Reinforcement Learning | Hopper D4RL v2 (offline) | Average Score100.8 | 32 | |
| Offline Reinforcement Learning | Hopper Medium JointNoise Shift | Average Return109.803 | 27 | |
| Offline Reinforcement Learning | Hopper Medium BodyMass Shift | Average Return82.786 | 27 | |
| Offline Reinforcement Learning | 1T10S Hopper (Medium-Expert) | Score111.587 | 26 | |
| Offline Reinforcement Learning | 1T10S Hopper (Medium-Replay) | Score98.988 | 26 | |
| Offline Reinforcement Learning | Hopper 1T10S (Medium) | Score101.244 | 26 | |
| Reinforcement Learning | Hopper v3 | Average Final Return4,104 | 26 | |
| Offline Reinforcement Learning | Hopper medium-expert | Normalized Score111.6 | 24 | |
| Offline Reinforcement Learning | hopper Mixed Dataset | Normalized Reward108 | 24 | |
| Locomotion | Hopper IID (test) | Mean Episode Reward1,859 | 24 | |
| Locomotion Control | Hopper sigma 0.3 (test) | Episode Reward1,368 | 24 | |
| Locomotion | Hopper | Convergence (%)100 | 20 | |
| Offline Reinforcement Learning | Hopper expert | Normalized Score112.8 | 19 | |
| Offline Reinforcement Learning | Hopper Medium-Expert BodyMass Shift | Average Return77.279 | 18 | |
| Offline Reinforcement Learning | Hopper Medium-Replay JointNoise Shift | Average Return93.704 | 18 | |
| Offline Reinforcement Learning | Hopper Medium-Expert 1T10S | Average Return109.803 | 18 | |
| Offline Reinforcement Learning | Hopper Medium-Replay 1T10S | Average Return93.704 | 18 | |
| Offline Reinforcement Learning | Hopper Medium 1T10S | Average Return78.325 | 18 | |
| Continuous Control | Hopper 3-Dof | Final Return2,735 | 18 | |
| Locomotion Control | Hopper sigma 0.7 (test) | Episode Reward443 | 18 | |
| Locomotion Control | Hopper sigma 0.5 (test) | Episode Reward729 | 18 | |
| Locomotion Control | Hopper sigma 0.1 (test) | Episode Reward1,859 | 18 |