Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on LunarLanderContinuous v2

533.6Mean Reward

Linear

-11.9424129.6888271.32412.9512Apr 30, 2022Oct 18, 2022Apr 7, 2023Sep 26, 2023Mar 15, 2024Sep 2, 2024Feb 21, 2025
Updated 17d ago

Evaluation Results

MethodLinks
2024.06
533.6130.04
2024.06
531.05115.63
2024.06
513.44129.48
2024.06
510.13123.56
2024.06
492.67126.01
2024.06
479.01124.67
2024.06
475.43125.81
2024.06
473.77127.78
2024.06
458.1131.61
2024.06
445.52143.65
2024.06
403.56143.06
2024.06
337.2298.64
2022.04
290.2924.4
2022.04
286.8721.65
2022.04
283.0516.28
2024.06
280.3617.19
2024.06
280.0917.16
2024.06
280.0817.14
2024.06
279.5117.67
2024.06
279.2418.12
2024.06
279.1918.42
2024.06
278.9619.22
2025.02
272.7-
2025.02
268.3-
2025.02
260.8-
2024.06
260.08238.84
2024.06
258.8431.52
2025.02
257.8-
2025.02
251.7-
2025.02
246.2-
2025.02
246.1-
2025.02
242.6-
2025.02
227.1-
2025.02
225.4-
2025.02
225.1-
2022.04
221.95133.8
2022.04
214.5593.79
2022.04
213.7599.67
2024.06
188.5121.78
2024.06
154.4843.2
2024.06
145.0542.29
2024.06
143.46112.2
2024.06
133.5138.96
2024.06
133.266.54
2024.06
128.81276.58
2024.06
127.2981.82
2022.04
114.97113.48
2024.06
109.7279.65
2024.06
106.87133.97
2024.06
101.33262.09
2024.06
96.5113.1
2024.06
91.77105.02
2024.06
75.52111.54
2024.06
73.170.24
2022.04
73.08163.33
2024.06
70.04120.93
2024.06
51.1599.21
2024.06
45.12140.02
2024.06
9.0498.49