Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Pusher

Benchmarks

Task NameDataset NameSOTA ResultTrend
PusherPusher
Convergence Rate100
20
Reinforcement LearningPusher
Average Returns39.88
10
Reinforcement LearningPusher v2
Average Final Return-19
7
Continuous ControlPusher v5
Final Return-30.4
6
PusherPusher
Metric-
0
Showing 5 of 5 rows