Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LunarLanderContinuous

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningLunarLanderContinuous v2
Mean Reward533.6
59
Continuous ControlLunarLanderContinuous offline trajectories v2
Episodic Cumulative Reward254.55
35
Surrogate ModelingLunarLanderContinuous v3 (val)
Fidelity (%)96.84
4
Showing 3 of 3 rows