| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Lunar Lander (test) | PPO | Average Reward255.57 | 6 | 4d ago | |
| Inverted Pendulum (test) | PPO | Average Reward996.35 | 6 | 4d ago | |
| Hopper (test) | TRPO | Average Reward864.12 | 6 | 4d ago | |
| CartPole (test) | TRPO | Average Reward494.09 | 6 | 4d ago | |
| OpenAI Gym LunarLander | POEM | T-statistic-1.8707 | 1 | 4d ago | |
| OpenAI Gym MountainCar | POEM | T-statistic-6.2431 | 1 | 4d ago | |
| OpenAI Gym CarRacing | POEM | T-statistic-6.3987 | 1 | 4d ago | |
| OpenAI Gym BipedalWalker | POEM | T-statistic-2.0642 | 1 | 4d ago | |
| MountainCar v0 (test) | - | - | 0 | 4d ago |