Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on LunarLander (LL) (test)

241Average Undiscounted Reward

MLP

-239.48-114.7410134.74Mar 11, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
241
2026.03
57
2026.03
-124
2026.03
-150
2026.03
-197
2026.03
-221