Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on BipedalWalker

325.35Average Episode Reward

DTSemNet

-128.2148-10.4624107.29225.0424Nov 2, 2023Apr 3, 2024Sep 3, 2024Feb 3, 2025Jul 6, 2025Dec 6, 2025May 8, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2026.05
325.35-
2026.05
315.3-
2026.05
314.98-
2023.11
314.24-
2023.11
312.14-
2023.11
311.78-
2025.02
310.2-
2023.11
309.57-
2023.11
309.43-
2023.11
308.31-
2025.02
307.3-
2026.05
301.34-
2023.11
291.79-
2023.11
287.43-
2025.02
286.2-
2025.02
280.9-
2025.02
280.4-
2025.02
264.4-
2025.02
262.6-
2026.05
244.5-
2025.02
241-
2025.02
235.1-
2023.11
209.42-
2025.02
94.2-
2026.05
78.33-
2023.11
-110.77-
2023.11
-1,000
2023.11
-400,000
2023.11
-2,000