| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Continuous Control | DMControl 500k | Spin Score973 | 33 | |
| Control | DMControl | DMControl: Ball in Cup Catch Score916.9 | 29 | |
| Continuous Control | DMControl 100k | DMControl: Finger Spin Score986.38 | 29 | |
| Visual Reinforcement Learning | DMControl Ball in cup, Catch | Episode Return960 | 16 | |
| Visual Reinforcement Learning | DMControl Finger, Spin | Episode Return976 | 16 | |
| Visual Reinforcement Learning | DMControl Walker Walk | Episode Return802 | 16 | |
| Visual Reinforcement Learning | DMControl Cheetah Run | Episode Return504 | 16 | |
| Visual Reinforcement Learning | DMControl Reacher Easy | Episode Return969 | 16 | |
| Visual Reinforcement Learning | DMControl Cartpole, Swingup | Episode Return872 | 16 | |
| Reinforcement Learning | DMControl | Hopper/Hop Error0.024 | 13 | |
| Finger, spin | DMControl Novel view (test) | Reward917.21 | 12 | |
| Cup, catch | DMControl Novel view (test) | Reward973.6 | 12 | |
| Offline Reinforcement Learning | DMControl walker-walk (expert) | Normalized Score97.97 | 12 | |
| Offline Reinforcement Learning | DMControl cheetah-run (expert) | Normalized Score96.28 | 12 | |
| Continuous Control | DMControl Novel view | Episode Reward770.56 | 8 | |
| Continuous Control | DMControl | Point Mass Easy885 | 7 | |
| Zero-shot Continuous Control | DMControl-GB average generalization | Cartpole Swingup Score808 | 7 | |
| Continuous Control | DMControl GB (train) | Cartpole Swingup Score872 | 7 | |
| Cheetah Run | DMControl-GB color-easy (test) | Average Episode Return346 | 7 | |
| Cartpole Swingup | DMControl-GB color-easy (test) | Average Episode Return856 | 7 | |
| Reinforcement Learning | DMControl Ball in cup, catch (500k steps) | Total Reward984 | 7 | |
| Reinforcement Learning | DMControl Cheetah, run (500k steps) | Total Reward731 | 7 | |
| Reinforcement Learning | DMControl Cartpole swingup 500k steps | Total Reward863 | 7 | |
| Reinforcement Learning | DMControl Finger, spin (500k steps) | Total Reward947 | 7 | |
| Reinforcement Learning | DMControl Ball in cup, catch 100k steps | Total Reward859 | 7 |