Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on LunarLanderContinuous v2

533.6Mean Reward

Linear

-11.9424129.6888271.32412.9512Apr 30, 2022Dec 30, 2022Sep 1, 2023May 3, 2024Jan 3, 2025Sep 5, 2025May 8, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2024.06
533.6130.04
2024.06
531.05115.63
2024.06
513.44129.48
2024.06
510.13123.56
2024.06
492.67126.01
2024.06
479.01124.67
2024.06
475.43125.81
2024.06
473.77127.78
2024.06
458.1131.61
2024.06
445.52143.65
2024.06
403.56143.06
2024.06
337.2298.64
2022.04
290.2924.4
2022.04
286.8721.65
2022.04
283.0516.28
2026.05
282.49-
2024.06
280.3617.19
2024.06
280.0917.16
2024.06
280.0817.14
2024.06
279.5117.67
2024.06
279.2418.12
2024.06
279.1918.42
2024.06
278.9619.22
2026.05
277.24-
2026.05
276.12-
2025.02
272.7-
2025.02
268.3-
2026.05
267.9-
2025.02
260.8-
2024.06
260.08238.84
2024.06
258.8431.52
2025.02
257.8-
2026.05
255.57-
2025.02
251.7-
2025.02
246.2-
2025.02
246.1-
2025.02
242.6-
2025.02
227.1-
2025.02
225.4-
2025.02
225.1-
2022.04
221.95133.8
2022.04
214.5593.79
2022.04
213.7599.67
2024.06
188.5121.78
2024.06
154.4843.2
2024.06
145.0542.29
2024.06
143.46112.2
2024.06
133.5138.96
2024.06
133.266.54
2026.05
131.92-
2024.06
128.81276.58
2024.06
127.2981.82
2022.04
114.97113.48
2024.06
109.7279.65
2024.06
106.87133.97
2024.06
101.33262.09
2024.06
96.5113.1
2024.06
91.77105.02
2024.06
75.52111.54
2024.06
73.170.24
2022.04
73.08163.33
2024.06
70.04120.93
2024.06
51.1599.21
2024.06
45.12140.02
2024.06
9.0498.49