| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Navigation | D4RL AntMaze umaze v2 | Initial D4RL Score137.4 | 12 | |
| Offline Reinforcement Learning | D4RL AntMaze medium-diverse v2 (test) | Normalized Score80.2 | 12 | |
| Offline Reinforcement Learning | D4RL AntMaze umaze-diverse v2 (test) | Score (Normalized)84 | 12 | |
| Navigation | D4RL AntMaze large-diverse v2 | Normalized D4RL Score58.4 | 10 | |
| Navigation | D4RL AntMaze large-play v2 | Normalized D4RL Score55.83 | 10 | |
| Navigation | D4RL AntMaze medium-diverse v2 | Normalized D4RL Score85.5 | 10 | |
| Navigation | D4RL AntMaze medium-play v2 | Normalized D4RL Score81 | 10 | |
| Offline Reinforcement Learning | D4RL AntMaze umaze, umaze-d, med-p, med-d, large-p, large-d v0 | umaze Return96.8 | 10 | |
| Offline Reinforcement Learning | D4RL AntMaze v0 (test) | Umaze Score87.5 | 10 | |
| Offline Reinforcement Learning | D4RL AntMaze fixed, play, diverse | AntMaze UMaze (Fixed) Score85 | 10 | |
| Offline Reinforcement Learning | D4RL Antmaze umaze, medium, large v0 | AntMaze UMaze v0 Score92.3 | 9 | |
| Offline Reinforcement Learning | D4RL Antmaze umaze, medium, large v2 | UMaze Score0.967 | 8 | |
| Offline Reinforcement Learning | D4RL Antmaze-Diverse | AntMaze-Medium9,240 | 8 | |
| Offline Reinforcement Learning | D4RL Antmaze (Play) | AntMaze Medium Score88.8 | 8 | |
| Offline multitask Reinforcement Learning | D4RL Antmaze large-play | Average Episodic Return306 | 7 | |
| Offline multitask Reinforcement Learning | D4RL Antmaze large-diverse | Avg Episodic Return359 | 7 | |
| Offline multitask Reinforcement Learning | D4RL Antmaze medium-play | Average Episodic Return624 | 7 | |
| Offline multitask Reinforcement Learning | D4RL Antmaze medium-diverse | Episodic Return631 | 7 | |
| Offline multitask Reinforcement Learning | D4RL Antmaze umaze-diverse | Average Episodic Return577 | 7 | |
| Offline multitask Reinforcement Learning | D4RL Antmaze umaze | Average Episodic Return593 | 7 | |
| Offline Goal-conditioned Reinforcement Learning | D4RL AntMaze large-diverse v2 | Normalized Score54.1 | 7 | |
| Offline Goal-conditioned Reinforcement Learning | D4RL AntMaze large-play v2 | Normalized Score48.6 | 7 | |
| Offline Goal-conditioned Reinforcement Learning | D4RL AntMaze medium-diverse v2 | Normalized Score72.5 | 7 | |
| Offline Goal-conditioned Reinforcement Learning | D4RL AntMaze medium-play v2 | Normalized Score72.6 | 7 | |
| Offline Goal-conditioned Reinforcement Learning | D4RL AntMaze umaze-diverse v2 | Normalized Score0.776 | 7 |