| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Locomotion | D4RL Ant medium-offline | Normalized Score85.28 | 36 | |
| Offline Imitation Learning | D4RL Ant v2 (expert) | Normalized Score126.4 | 20 | |
| Offline Reinforcement Learning | D4RL ant medium v3 | Normalized Score98.9 | 7 | |
| Offline Reinforcement Learning | D4RL Ant Medium-Replay v2 | Normalized Score92.7 | 4 | |
| Offline Reinforcement Learning | D4RL Ant Medium-Expert v2 | Normalized Score136.2 | 4 | |
| Reinforcement Learning | D4RL Ant Med-Expert | D4RL Score125.47 | 2 | |
| Reinforcement Learning | D4RL Ant Med-Replay | D4RL Score89.39 | 2 | |
| Reinforcement Learning | D4RL Ant Medium | D4RL Score94.25 | 2 |