| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MuJoCo Ant v4 | TD7 | Average Return8,509 | 46 | 13d ago | |
| DMControl 500k | Spin Score979 | 42 | 2mo ago | ||
| MuJoCo Walker2d v4 | Opti-DICE | Normalized Performance13,060 | 39 | 13d ago | |
| DMControl 100k | Sampled MuZero | DMControl: Finger Spin Score986.38 | 38 | 2mo ago | |
| MuJoCo HalfCheetah v4 | TD7 | Average Return17,433 | 36 | 13d ago | |
| LunarLanderContinuous offline trajectories v2 | MFRL | Episodic Cumulative Reward254.55 | 35 | 3mo ago | |
| Mountain Car POMDP | AG-PFT-DPW | Mean Performance26.96 | 30 | 15d ago | |
| MuJoCo Hopper v4 | tdBN | Normalized Performance3,592 | 28 | 3mo ago | |
| MountainCar Source | Success Rate100 | 27 | 3mo ago | ||
| MuJoCo Ant | TOP-TD3 | Average Reward6,336 | 26 | 2mo ago | |
| MuJoCo HalfCheetah | TOP-TD3 | Average Reward13,144 | 25 | 1mo ago | |
| Humanoid 17-Dof | SATR | Final Return13,860 | 21 | 3mo ago | |
| MuJoCo Swimmer v4 | PPO | Total Reward362.4 | 19 | 13d ago | |
| D4RL Hopper medium | OFQL | Normalized Return103.6 | 19 | 3mo ago | |
| Hopper 3-Dof | SATR | Final Return2,735 | 18 | 3mo ago | |
| MountainCar Drift II - Dynamics Shift | Success Rate100 | 18 | 3mo ago | ||
| MountainCar Drift I - Dynamics Shift | Success Rate100 | 18 | 3mo ago | ||
| MuJoCo Reacher v4 | DIDA | Normalized Performance103 | 18 | 3mo ago | |
| MuJoCo Pusher v4 | AD-SAC | Normalized Performance1.36 | 18 | 3mo ago | |
| MuJoCo HumanoidStandup v4 | VDPO | Normalized Performance1.29 | 18 | 3mo ago | |
| MuJoCo Humanoid v4 | VDPO | Normalized Performance (Ret_nor)115 | 18 | 3mo ago | |
| MuJoCo HalfCheetah v4 | AD-SAC | Normalized Performance107 | 18 | 3mo ago | |
| DMC-GB video hard | SGQN | Cartpole Swingup Score54,443 | 18 | 1mo ago | |
| MuJoCo Reacher | TRPO | Average Reward6.22 | 18 | 1mo ago | |
| Walker2d v5 | DBC | Avg Return6,138.2 | 17 | 1mo ago |