| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Offline Reinforcement Learning | antmaze medium-play | Score85.6 | 44 | |
| Offline Reinforcement Learning | Antmaze Medium play offline (target domain) | Target Domain Score (Normalized)398.8 | 42 | |
| Offline Reinforcement Learning | antmaze medium-diverse | Score85 | 27 | |
| Offline Reinforcement Learning | Antmaze umaze | Average Return96.7 | 24 | |
| Goal-conditioned Reinforcement Learning | antmaze stitch large | Success Rate88 | 23 | |
| Goal-conditioned Reinforcement Learning | antmaze stitch medium | Success Rate69 | 23 | |
| Offline Reinforcement Learning | antmaze large-play | Score78.2 | 18 | |
| Navigation | AntMaze | Success Rate9,110 | 16 | |
| Navigation | AntMaze Small | Success Rate9,510 | 16 | |
| Offline Reinforcement Learning | AntMaze large-diverse (l-d) | Normalized Score81.8 | 15 | |
| Offline Reinforcement Learning | Antmaze umaze-diverse | Average Return90.7 | 15 | |
| Offline Reinforcement Learning | AntMaze umaze-diverse | Normalized Average Return84 | 14 | |
| Offline Goal-Conditioned Reinforcement Learning | antmaze medium-navigate v0 | Success Rate96 | 14 | |
| Offline Reinforcement Learning | AntMaze Medium-Diverse v2 | Average Score6.6 | 14 | |
| Offline Reinforcement Learning | AntMaze Medium-Play v2 | Average Score89.5 | 14 | |
| Offline Reinforcement Learning | antmaze medium-play v0 | Avg Normalized Score8,830 | 14 | |
| Offline Reinforcement Learning | antmaze umaze-diverse v0 | Avg Normalized Score88.5 | 14 | |
| Offline Reinforcement Learning | antmaze umaze v0 | Averaged Normalized Score98.6 | 14 | |
| Offline Reinforcement Learning | AntMaze Ultra-Diverse | Avg Normalized Score5,460 | 14 | |
| Offline Reinforcement Learning | AntMaze-Ultra-Play | Avg Normalized Score56.6 | 14 | |
| Offline Goal-Conditioned Reinforcement Learning | antmaze teleport-stitch v0 | Success Rate49 | 13 | |
| Offline Goal-Conditioned Reinforcement Learning | antmaze giant-stitch v0 | Success Rate32 | 13 | |
| Offline Goal-Conditioned Reinforcement Learning | antmaze giant-navigate v0 | Success Rate74 | 13 | |
| Reinforcement Learning | AntMaze large-play D4RL | Average Episodic Return533 | 12 | |
| Reinforcement Learning | AntMaze umaze D4RL | Average Episodic Return623 | 12 |