Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reinforcement Learning on Inverted Double Pendulum
Loading...
9,359.92
Avg Episode Reward
SAC
288.2912
2,643.4256
4,998.56
7,353.6944
Nov 2, 2023
Avg Episode Reward
Updated 4d ago
Evaluation Results
Method
Method
Links
Avg Episode Reward
SAC
2023.11
9,359.92
ESPL
2023.11
9,359.9
A2C
2023.11
9,359.81
TD3
2023.11
9,359.25
ACKTR
2023.11
9,359.06
PPO
2023.11
9,356.59
DDPG
2023.11
9,347.1
TRPO
2023.11
9,188.43
DSP
2023.11
9,149.9
Regression
2023.11
637.2
Feedback
Search any
task
Search any
task