| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Offline Reinforcement Learning | D4RL halfcheetah-medium-expert | Normalized Score110 | 169 | |
| Offline Reinforcement Learning | D4RL hopper-medium-expert | Normalized Score119.2 | 161 | |
| Offline Reinforcement Learning | D4RL walker2d-medium-expert | Normalized Score116.1 | 132 | |
| Offline Reinforcement Learning | D4RL Medium-Replay Hopper | Normalized Score110.6 | 109 | |
| Offline Reinforcement Learning | D4RL Medium HalfCheetah | Normalized Score84.3 | 105 | |
| Offline Reinforcement Learning | D4RL Medium Walker2d | Normalized Score106.4 | 104 | |
| Offline Reinforcement Learning | D4RL walker2d-random | Normalized Score510 | 101 | |
| Offline Reinforcement Learning | D4RL Medium-Replay HalfCheetah | Normalized Score95.8 | 97 | |
| Offline Reinforcement Learning | D4RL halfcheetah-random | Normalized Score45.4 | 94 | |
| Locomotion | D4RL walker2d-medium-expert | Normalized Score5,421.3 | 90 | |
| Offline Reinforcement Learning | D4RL hopper-random | Normalized Score53.6 | 86 | |
| walker2d locomotion | D4RL walker2d medium-replay | Normalized Score106.2 | 78 | |
| Offline Reinforcement Learning | D4RL antmaze-umaze (diverse) | Normalized Score93.5 | 74 | |
| Offline Reinforcement Learning | D4RL Gym walker2d (medium-replay) | Normalized Return109.7 | 73 | |
| Offline Reinforcement Learning | D4RL Medium Hopper | Normalized Score109.4 | 72 | |
| hopper locomotion | D4RL hopper medium-replay | Normalized Score105.12 | 71 | |
| Locomotion | D4RL walker2d-medium | Normalized Score88.1 | 70 | |
| Locomotion | D4RL halfcheetah-medium | Normalized Score63.5 | 70 | |
| Locomotion | D4RL halfcheetah-medium-replay | Normalized Score0.8874 | 68 | |
| Offline Reinforcement Learning | D4RL halfcheetah v2 (medium-replay) | Normalized Score76.9 | 68 | |
| Offline Reinforcement Learning | D4RL halfcheetah-expert v2 | Normalized Score113.7 | 66 | |
| Offline Reinforcement Learning | D4RL walker2d-expert v2 | Normalized Score116.3 | 66 | |
| Offline Reinforcement Learning | D4RL hopper-expert v2 | Normalized Score118.9 | 66 | |
| Offline Reinforcement Learning | D4RL AntMaze | AntMaze Umaze Return99.8 | 65 | |
| Offline Reinforcement Learning | D4RL Hopper-medium-replay v2 | Normalized Return110.7 | 64 |