Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MuJoCo Walker

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningMuJoCo Walker
Average Return6,115
14
Continuous ControlMuJoCo Walker fixed random adversary L=0.1
Avg Performance5,278
12
Reinforcement LearningMuJoCo Walker (test)
Average Performance4,888
12
Continuous ControlMuJoCo Walker logarithmic adversary v1
Average Performance4,931
12
Inverse Reinforcement LearningMuJoCo Walker (test)
Average Performance5,423
4
LocomotionMuJoCo Walker (t = T)
Average Return755
3
LocomotionMuJoCo Walker t = 3T/4
Average Return631.6
3
LocomotionMuJoCo Walker t = 2T/3 (shift)
Average Return411.6
3
LocomotionMuJoCo Walker t = T/2 (shift)
Average Return323.8
3
Showing 9 of 9 rows