Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on Pusher v2
Loading...
-19
Average Final Return
DACER
-30.44
-27.47
-24.5
-21.53
May 24, 2024
Average Final Return
Updated 1mo ago
Evaluation Results
Method
Method
Links
Average Final Return
DACER
2024.05
-19
DSAC
2024.05
-19
SAC
2024.05
-20
TD3
2024.05
-21
TRPO
2024.05
-23
PPO
2024.05
-23
DDPG
2024.05
-30
Feedback
Search any
task
Search any
task