Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PUSH

Benchmarks

Task NameDataset NameSOTA ResultTrend
Humanoid LocomotionPush In-distribution (deterministic evaluation)
Cumulative Reward5.01
4
Reinforcement Learningpush 10-p
Normalized Return84.1
4
Reinforcement Learningpush 2-p
Normalized Return95.2
4
Reinforcement Learningpush 4-p
Normalized Return92.4
4
Video PredictionPUSH1
FVD630.4
4
Showing 5 of 5 rows