Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on LunarLander (Average Episode Reward)

283.56Average Episode Reward

ESPL

-99.7424-0.231299.28198.7912Nov 2, 2023Apr 3, 2024Sep 3, 2024Feb 3, 2025Jul 6, 2025Dec 6, 2025May 8, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2023.11
283.56
2023.11
276.92
2023.11
272.13
2023.11
271.53
2023.11
269.65
2023.11
266.05
2023.11
265.26
2023.11
261.36
2026.05
252.5
2026.05
245
2023.11
238.51
2026.05
183.6
2026.05
86.73
2023.11
56.08
2026.05
-85