| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Time Series Reconstruction | MuJoCo (test) | MSE0.285 | 51 | |
| Offline Reinforcement Learning | MujoCo halfcheetah | Normalized Return60.8 | 33 | |
| Offline Reinforcement Learning | MuJoCo hopper D4RL (medium-replay) | Normalized Return101.6 | 26 | |
| Continuous Control | MuJoCo Ant v4 | Normalized Return136 | 24 | |
| Reinforcement Learning | MuJoCo HumanoidStandup | Average Performance130,892 | 24 | |
| 3D Dynamics Prediction | MuJoCo Fall-and-rebound scenario | Translation Error (m)0.0048 | 20 | |
| Offline Reinforcement Learning | MuJoCo halfcheetah-medium-replay D4RL | Normalized Return54.1 | 20 | |
| Offline Reinforcement Learning | MuJoCo walker2d-medium D4RL | Normalized Return88.2 | 20 | |
| Offline Reinforcement Learning | MuJoCo halfcheetah-medium D4RL | Normalized Return65.6 | 20 | |
| Continuous Control | MuJoCo Reacher v4 | Normalized Performance103 | 18 | |
| Continuous Control | MuJoCo Pusher v4 | Normalized Performance1.36 | 18 | |
| Continuous Control | MuJoCo HumanoidStandup v4 | Normalized Performance1.29 | 18 | |
| Continuous Control | MuJoCo Hopper v4 | Normalized Performance1.25 | 18 | |
| Offline Reinforcement Learning | MuJoCo halfcheetah-medium-expert D4RL | Normalized Return101.1 | 18 | |
| Reinforcement Learning | MuJoCo Half-Cheetah | Average Return13,300 | 18 | |
| HalfCheetah | Mujoco | Reward9.48 | 16 | |
| Ant | MuJoCo | Recovery Time (%)5.9 | 16 | |
| Reinforcement Learning | MuJoCo Ant | Average Return7,889.1 | 14 | |
| Reinforcement Learning | MuJoCo Hopper | Average Return3,876 | 14 | |
| Offline Reinforcement Learning | MuJoCo hopper-medium D4RL | Normalized Return96.9 | 13 | |
| Continuous Control | MuJoCo Hopper fixed random adversary L=0.1 | Average Performance2,365 | 12 | |
| Reinforcement Learning | MuJoCo Hopper (test) | Average Reward1,946 | 12 | |
| Reinforcement Learning | MuJoCo HalfCheetah (test) | Avg Performance8,174 | 12 | |
| Continuous Control | MuJoCo v2 (test) | Ant Score1.78 | 12 | |
| Continuous Control | MuJoCo Reacher | Average Reward-3.85 | 12 |