| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Reinforcement Learning | Ant v5 | Average Return6,633.8 | 49 | |
| Continuous Robot Control | Ant v3 (test) | Reward5,648 | 48 | |
| Locomotion | Ant IID (test) | Mean Episode Reward2,240 | 24 | |
| Locomotion Control | Ant sigma 0.5 (test) | Episode Reward974 | 24 | |
| Locomotion Control | Ant sigma 0.3 (test) | Episode Reward1,723 | 24 | |
| Locomotion Control | Ant sigma 0.1 (test) | Episode Reward2,240 | 24 | |
| Locomotion Control | Ant sigma 0.7 (test) | Episode Reward306 | 18 | |
| Offline Reinforcement Learning | Ant kinematic shifts | Score120 | 16 | |
| Offline Reinforcement Learning | Ant Medium D4RL | Normalized Score96.4 | 14 | |
| Offline Policy Adaptation | ant medium-expert | Normalized Score79.3 | 14 | |
| Offline Policy Adaptation | ant medium-replay | Normalized Score76.2 | 14 | |
| Offline Policy Adaptation | ant medium | Normalized Score77.2 | 14 | |
| Continuous Control | Ant v5 | Normalized Mean Return1.14 | 12 | |
| Reinforcement Learning | Ant fixed linear adversary | Average Performance8,069 | 12 | |
| Worst-case time-constrained reinforcement learning | Ant MuJoCo (test) | Normalized Worst-Case Reward1.66 | 12 | |
| Robust Reinforcement Learning | Ant MuJoCo (fixed exponential adversary) | Average Performance7,724 | 12 | |
| Continuous Control | Ant MuJoCo (test) | Worst-case Performance7,534 | 12 | |
| Robot Locomotion | Ant v1 (test) | Performance Score2,370.93 | 12 | |
| Imitation Learning | Ant one-shot v2 | Normalized Score29.7 | 11 | |
| Imitation Learning from Observation | Ant v4 | AER5,904.2506 | 8 | |
| Offline Reinforcement Learning | Ant expert | Normalized Score23.1 | 7 | |
| Offline Reinforcement Learning | Ant random | Normalized Score20.3 | 7 | |
| Continuous Control | Ant v3 | Average Return5,115 | 7 | |
| Offline Policy Adaptation | ant-medium morphology shift target: expert D4RL | Normalized Avg Score74.1 | 7 | |
| Offline Policy Adaptation | ant medium gravity shift target D4RL | Average Score45.1 | 7 |