| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Control Task | Lunar Lander (test) | Average Reward255.57 | 6 | |
| MCTS Aggregation Strategy Evaluation | Lunar Lander | MRR83.33 | 6 | |
| Lunar Landing | Lunar Lander modified (Laggy Pilot) | Success Rate100 | 6 | |
| Lunar Landing | Lunar Lander modified per Yoneda et al. (Noisy Pilot) | Success Rate83 | 6 | |
| Lunar Landing | Lunar Lander modified (Expert Pilot) | Success Rate100 | 6 | |
| Classic Control | Lunar Lander OpenAI Gym (evaluation) | Mean Score163.5 | 5 | |
| Lunar Lander Control | Lunar Lander | Success Rate0.46 | 4 | |
| Multi-Objective Reinforcement Learning | Lunar Lander 4d | Hypervolume (HV)1.23 | 4 | |
| Human-in-the-loop Control | Lunar Lander | Success Rate91.7 | 3 |