| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Reinforcement Learning | LunarLanderContinuous v2 | Mean Reward533.6 | 59 | |
| Continuous Control | LunarLanderContinuous offline trajectories v2 | Episodic Cumulative Reward254.55 | 35 | |
| Surrogate Modeling | LunarLanderContinuous v3 (val) | Fidelity (%)96.84 | 4 |