| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Control Task | Inverted Pendulum (test) | Average Reward996.35 | 6 | |
| Inverted Pendulum Control | Inverted Pendulum (test) | ROA6,131 | 6 | |
| Continuous Control | Inverted Pendulum | Normalized AUC0.97 | 3 | |
| Sensory-motor control | Inverted Pendulum MuJoCo | Mean Best Reward829.66 | 2 |