Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on LunarLander (Average Episode Reward)
Loading...
283.56
Average Episode Reward
ESPL
-99.7424
-0.2312
99.28
198.7912
Nov 2, 2023
Apr 3, 2024
Sep 3, 2024
Feb 3, 2025
Jul 6, 2025
Dec 6, 2025
May 8, 2026
Average Episode Reward
Updated 23d ago
Evaluation Results
Method
Method
Links
Average Episode Reward
ESPL
2023.11
283.56
SAC
2023.11
276.92
TD3
2023.11
272.13
ACKTR
2023.11
271.53
PPO
2023.11
269.65
DDPG
2023.11
266.05
TRPO
2023.11
265.26
DSP
2023.11
261.36
DTSemNet
Nf (Number of features...
2026.05
252.5
Deep RL
Nf (Number of features...
2026.05
245
A2C
2023.11
238.51
DGT
Nf (Number of features...
2026.05
183.6
VIPER
Nf (Number of features...
2026.05
86.73
Regression
2023.11
56.08
ICCT
Nf (Number of features...
2026.05
-85
Feedback
Search any
task
Search any
task