| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Offline Reinforcement Learning | walker2d medium-replay | Normalized Score101.3 | 61 | |
| Offline Reinforcement Learning | walker2d medium | Normalized Score1,248 | 61 | |
| Reinforcement Learning | Walker2D v5 | Average Return6,335.5 | 45 | |
| Offline Reinforcement Learning | Walker2d medium-expert | Normalized Score121.4 | 42 | |
| Offline Reinforcement Learning | Walker2d D4RL v2 (offline) | Return86.7 | 32 | |
| Offline Reinforcement Learning | Walker2D Medium BodyMass Shift | Average Return94.578 | 27 | |
| Offline Reinforcement Learning | 1T10S Walker2D (Medium-Expert) | Score113.069 | 26 | |
| Offline Reinforcement Learning | 1T10S Walker2D (Medium-Replay) | Performance Score71.67 | 26 | |
| Offline Reinforcement Learning | 1T10S Walker2D (Medium) | Score80.693 | 26 | |
| Reinforcement Learning | Walker2d v3 | Average Final Return6,701 | 26 | |
| Offline Reinforcement Learning | walker2d Mixed Dataset | Normalized Reward64.4 | 24 | |
| Offline Reinforcement Learning | Walker2d | Clean Score3,718 | 21 | |
| Locomotion | Walker2d Medium-Expert v2 | Average Normalized Score115.9 | 19 | |
| Locomotion | Walker2d Medium-Replay v2 | Average Normalized Score97.4 | 19 | |
| Locomotion | Walker2d Medium v2 | Average Normalized Score92.5 | 19 | |
| Per time-step regression | Walker2D | Squared Error0.617 | 19 | |
| Offline Reinforcement Learning | Walker2D Medium-Replay BodyMass Shift | Average Return87.491 | 18 | |
| Offline Reinforcement Learning | Walker2D Medium-Expert 1T10S | Average Return118.564 | 18 | |
| Offline Reinforcement Learning | Walker2D Medium-Replay 1T10S | Average Return87.491 | 18 | |
| Offline Reinforcement Learning | Walker2D Medium 1T10S | Average Return84.582 | 18 | |
| Reinforcement Learning | Walker2d v4 | Avg Return39,641,353 | 18 | |
| Offline Reinforcement Learning | walker2d medium v2 | Normalized Score92.7 | 18 | |
| Continuous Control | Walker2d v5 | Avg Return6,138.2 | 17 | |
| Offline Reinforcement Learning | Walker2d kinematic shifts | Score106 | 16 | |
| Offline Reinforcement Learning | Walker2d | Average Return2,278.9 | 16 |