Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on LunarLander (LL) (test)
Loading...
241
Average Undiscounted Reward
MLP
-239.48
-114.74
10
134.74
Mar 11, 2026
Average Undiscounted Reward
Updated 1mo ago
Evaluation Results
Method
Method
Links
Average Undiscounted Reward
MLP
Number of random trial...
2026.03
241
SYMPOL
Number of random trial...
2026.03
57
SDT
Number of random trial...
2026.03
-124
SA-DT
Depth (d)=8, Number of...
2026.03
-150
SA-DT
Depth (d)=5, Number of...
2026.03
-197
D-SDT
Number of random trial...
2026.03
-221
Feedback
Search any
task
Search any
task