| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Reinforcement Learning | BipedalWalker | Average Episode Reward314.24 | 10 | |
| Continuous Control | BipedalWalker Nonmarkov v3 | AUC@T184.7 | 9 | |
| Continuous Control | BipedalWalker v3 | Episodic Cumulative Reward276.98 | 8 | |
| Reinforcement Learning | BipedalWalker v3 | Return180.58 | 2 | |
| Reinforcement Learning | bipedalwalker Sticky | AUC@T42,687,915.83 | 2 | |
| Reinforcement Learning | bipedalwalker Noisy | AUC@T32,301,685.83 | 2 | |
| Reinforcement Learning | bipedalwalker (Clean) | AUC@T9,392,950.11 | 2 | |
| Reinforcement Learning | BipedalWalker standard (test) | Length17 | 2 | |
| Interpretability Evaluation | BipedalWalker | Interpretability Score3.2 | 2 |