Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on InvertedDoublePendulum v3
Loading...
9,360
Average Final Return
DACER
6,134.96
6,972.23
7,809.5
8,646.77
May 24, 2024
Average Final Return
Updated 1mo ago
Evaluation Results
Method
Method
Links
Average Final Return
DACER
2024.05
9,360
DSAC
2024.05
9,360
SAC
2024.05
9,360
PPO
2024.05
9,356
TD3
2024.05
9,347
DDPG
2024.05
9,183
TRPO
2024.05
6,259
Feedback
Search any
task
Search any
task