| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Continuous Control | DeepMind Control Suite visual observations | Acrobot Swingup Score24,829 | 16 | |
| Continuous Control | DeepMind Control Suite (DMC) | Cheetah Run873 | 15 | |
| Continuous Control | DeepMind Control (DMC) Suite (1M steps) | IQM87.1 | 14 | |
| Continuous Control | DeepMind Control Suite Cheetah Run | Reward836 | 13 | |
| Continuous Control | DeepMind Control Suite Reacher Hard (test) | Reward975.35 | 12 | |
| Continuous Control | DeepMind Control Suite Point Mass - Easy | Reward912.65 | 12 | |
| Hopper Hop | DeepMind Control Suite (DMC) | Steps Required (k)153.5 | 12 | |
| World model image prediction | DeepMind Control Suite Humanoid | MSE4.2077 | 12 | |
| Physical state prediction | DeepMind Control Suite Humanoid Easy tasks (random policy) | MSE0.6535 | 12 | |
| Physical state prediction | DeepMind Control Suite Reacher Easy tasks (random policy) | MSE0.0005 | 12 | |
| Physical state prediction | DeepMind Control Suite Cheetah Easy tasks (random policy) | MSE0.1206 | 12 | |
| Walker Run | DeepMind Control Suite (DMC) | Steps (k)512.6 | 10 | |
| Cup Catch | DeepMind Control Suite (DMC) | Sample Efficiency (Steps)104,700 | 10 | |
| World model image prediction | DeepMind Control Suite Acrobot | MSE0.1806 | 10 | |
| World model image prediction | DeepMind Control Suite Cheetah | MSE0.1565 | 10 | |
| Physical state prediction | DeepMind Control Suite Acrobot Easy tasks (random policy) | MSE0.0001 | 10 | |
| Walker Walk | DeepMind Control Suite (in-distribution) | Average Return996.502 | 10 | |
| World model image prediction | DeepMind Control Suite Walker | MSE1.0044 | 9 | |
| World model image prediction | DeepMind Control Suite Hopper | MSE0.3149 | 9 | |
| Continuous Control | DeepMind Control Suite Walker Run | Reward698.4 | 9 | |
| Pixel-based Control | DeepMind Control Suite 500k environment steps | Cheetah Run Score803 | 9 | |
| Pixel-based Control | DeepMind Control Suite 100k steps | Cheetah/Run Score512 | 9 | |
| Walker Run | DeepMind control suite | Average Return856 | 8 | |
| Cheetah Run | DeepMind control suite | Average Return922 | 8 | |
| Hopper Hop | DeepMind control suite | Average Return511 | 8 |