Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Lunar Lander on Lunar Lander (train)
Loading...
1.09
Average Reward
MLES
0.61056
0.73503
0.8595
0.98397
Aug 7, 2025
Average Reward
Standard Error of the Mean (SEM)
Best Run Reward
Updated 1mo ago
Evaluation Results
Method
Method
Links
Average Reward
Standard Error of the Mean (SEM)
Best Run Reward
MLES
2025.08
1.09
0.005
1.098
EoH
2025.08
1.053
0.021
1.085
PPO
2025.08
1.032
0.026
1.076
DQN
2025.08
1.017
0.022
1.066
Initial policy
2025.08
0.629
-
-
Feedback
Search any
task
Search any
task