Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reinforcement Learning on BipedalWalker
Loading...
314.24
Average Episode Reward
TD3
-127.7704
-13.0177
101.735
216.4877
Nov 2, 2023
Average Episode Reward
Total Episodes
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Episode Reward
Total Episodes
TD3
2023.11
314.24
-
TRPO
2023.11
312.14
-
DSP
2023.11
311.78
-
ACKTR
2023.11
309.57
-
ESPL
2023.11
309.43
-
SAC
2023.11
308.31
-
A2C
2023.11
291.79
-
PPO
2023.11
287.43
-
DDPG
2023.11
209.42
-
Regression
2023.11
-110.77
-
Regression
2023.11
-
1,000
DSP
2023.11
-
400,000
ESPL
Selection=Best policy...
2023.11
-
2,000
Feedback
Search any
task
Search any
task