Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reinforcement Learning on BipedalWalker

314.24Average Episode Reward

TD3

-127.7704-13.0177101.735216.4877Nov 2, 2023
Updated 4d ago

Evaluation Results

MethodLinks
2023.11
314.24-
2023.11
312.14-
2023.11
311.78-
2023.11
309.57-
2023.11
309.43-
2023.11
308.31-
2023.11
291.79-
2023.11
287.43-
2023.11
209.42-
2023.11
-110.77-
2023.11
-1,000
2023.11
-400,000
2023.11
-2,000