Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Pusher v2

-19Average Final Return

DACER

-30.44-27.47-24.5-21.53May 24, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.05
-19
2024.05
-19
2024.05
-20
2024.05
-21
2024.05
-23
2024.05
-23
2024.05
-30