| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Reinforcement Learning | Cheetah | Return934.76 | 24 | |
| Continuous Control | cheetah | Average Reward934.76 | 12 | |
| Actuator Inversion | Cheetah Ceval-in (eval-in) | AER319 | 8 | |
| Actuator Inversion | Cheetah (train) | AER319 | 8 | |
| Single-life task completion | Cheetah | Average Steps74,300 | 5 | |
| Meta-Reinforcement Learning | Cheetah vel-ood | FLOPs (k)0.53 | 3 | |
| Continuous Control | hardCheetah | Average Reward1.311 | 3 | |
| Latent space prediction | Cheetah | MSE0.0003 | 2 | |
| Reinforcement Learning | Cheetah | Zero-shot Reward10,941,130 | 1 |