| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Continuous Control | MuJoCo Humanoid v4 | Normalized Performance (Ret_nor)115 | 18 | |
| Reinforcement Learning | MuJoCo Humanoid v2 | Average Return10,490 | 18 | |
| Continuous Control | MuJoCo Humanoid v2 (train) | Mean Return6,242 | 6 | |
| Reinforcement Learning | MuJoCo Humanoid v2 (test) | Max Avg Return9,080.54 | 6 | |
| Continuous Control | MuJoCo Humanoid v5 (test) | Average Return5,701.2 | 4 | |
| Meta-Reinforcement Learning | MuJoCo Humanoid Body variation (test) | CVaR 0.05 Return1,365 | 2 | |
| Meta-Reinforcement Learning | MuJoCo Humanoid Mass variation (test) | CVaR 0.05 Return1,378 | 2 | |
| Meta-Reinforcement Learning | MuJoCo Humanoid Velocity variation (test) | CVaR 0.05 Return833 | 2 | |
| Reinforcement Learning | MuJoCo Humanoid | Average Return10,249 | 2 | |
| Continuous control locomotion | MuJoCo Humanoid v3 (train) | Avg Performance (1M Steps)665 | 2 |