| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Offline Reinforcement Learning | walker2d medium | Normalized Score1,248 | 51 | |
| Offline Reinforcement Learning | walker2d medium-replay | Normalized Score99.6 | 50 | |
| Reinforcement Learning | Walker2D v5 | Average Return6,335.5 | 43 | |
| Offline Reinforcement Learning | Walker2d D4RL v2 (offline) | Return86.7 | 32 | |
| Offline Reinforcement Learning | Walker2d medium-expert | Normalized Score114.4 | 31 | |
| Offline Reinforcement Learning | walker2d Mixed Dataset | Normalized Reward64.4 | 24 | |
| Offline Reinforcement Learning | Walker2d | Clean Score3,718 | 21 | |
| Per time-step regression | Walker2D | Squared Error0.617 | 19 | |
| Offline Reinforcement Learning | Walker2d kinematic shifts | Score106 | 16 | |
| Offline Reinforcement Learning | Walker2d | Average Return2,278.9 | 16 | |
| Offline Policy Adaptation | walker2d medium-expert | Normalized Score62.1 | 14 | |
| Offline Policy Adaptation | walker2d medium-replay | Normalized Score63.1 | 14 | |
| Offline Policy Adaptation | walker2d medium | Normalized Score56.7 | 14 | |
| Offline Reinforcement Learning | Walker2d random | Normalized Score21.5 | 14 | |
| Reinforcement Learning | Walker2d v4 | Avg Return39,641,353 | 13 | |
| Continuous Control | Walker2d 6-Dof | Final Return5,143 | 12 | |
| Offline Reinforcement Learning | Walker2d Expert | Episodic Return211.31 | 12 | |
| Imitation Learning | Walker2d one-shot v2 | Normalized Score70 | 11 | |
| Offline Reinforcement Learning | walker2d medium v2 | Average Score2,448 | 9 | |
| Continuous Control | walker2d | Avg Reward5,111 | 9 | |
| Offline-to-online Reinforcement Learning | walker2d | Regret544.4 | 8 | |
| Offline Reinforcement Learning | Walker2d v2 (offline) | Observation Value0 | 8 | |
| Reinforcement Learning | Walker2d gravity v2 | Average Return5,866 | 8 | |
| Offline Reinforcement Learning | Walker2D Medium-Replay Noise 0 | Normalized Return90.67 | 7 | |
| Offline Reinforcement Learning | Walker2D Medium-Replay Noise 7 | Normalized Return74.04 | 7 |