| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Offline Reinforcement Learning | D4RL halfcheetah-medium-expert | Normalized Score110 | 155 | |
| Offline Reinforcement Learning | D4RL hopper-medium-expert | Normalized Score119.2 | 153 | |
| Offline Reinforcement Learning | D4RL walker2d-medium-expert | Normalized Score115.7 | 124 | |
| Offline Reinforcement Learning | D4RL Medium-Replay Hopper | Normalized Score110.6 | 97 | |
| Offline Reinforcement Learning | D4RL Medium HalfCheetah | Normalized Score84.3 | 97 | |
| Offline Reinforcement Learning | D4RL Medium Walker2d | Normalized Score106.4 | 96 | |
| Offline Reinforcement Learning | D4RL walker2d-random | Normalized Score510 | 93 | |
| Offline Reinforcement Learning | D4RL halfcheetah-random | Normalized Score45.4 | 86 | |
| Offline Reinforcement Learning | D4RL Medium-Replay HalfCheetah | Normalized Score77.6 | 84 | |
| Offline Reinforcement Learning | D4RL hopper-random | Normalized Score53.6 | 78 | |
| Offline Reinforcement Learning | D4RL Gym walker2d (medium-replay) | Normalized Return109.7 | 68 | |
| hopper locomotion | D4RL hopper medium-replay | Normalized Score105.12 | 66 | |
| Offline Reinforcement Learning | D4RL AntMaze | AntMaze Umaze Return99.8 | 65 | |
| Offline Reinforcement Learning | D4RL Medium Hopper | Normalized Score109.4 | 64 | |
| Locomotion | D4RL walker2d-medium-expert | Normalized Score121.4 | 63 | |
| walker2d locomotion | D4RL walker2d medium-replay | Normalized Score106.2 | 63 | |
| Offline Reinforcement Learning | D4RL walker2d-medium-replay | Normalized Score99.3 | 62 | |
| Locomotion | D4RL halfcheetah-medium-replay | Normalized Score0.8874 | 61 | |
| Locomotion | D4RL walker2d-medium | Normalized Score88.1 | 60 | |
| Locomotion | D4RL halfcheetah-medium | Normalized Score63.5 | 60 | |
| Offline Reinforcement Learning | D4RL halfcheetah v2 (medium-replay) | Normalized Score76.9 | 58 | |
| Offline Reinforcement Learning | D4RL halfcheetah-expert v2 | Normalized Score106.8 | 56 | |
| Offline Reinforcement Learning | D4RL walker2d-expert v2 | Normalized Score115.9 | 56 | |
| Offline Reinforcement Learning | D4RL hopper-expert v2 | Normalized Score113 | 56 | |
| Offline Reinforcement Learning | D4RL Hopper-medium-replay v2 | Normalized Return107.4 | 54 |