Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

hopper

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningHopper v5
Average Return3,732.5
101
Offline Reinforcement Learninghopper medium
Normalized Score3,729
58
Offline Reinforcement Learninghopper medium-replay
Normalized Score113
44
Offline Reinforcement LearningHopper D4RL v2 (offline)
Average Score100.8
32
Offline Reinforcement LearningHopper Medium JointNoise Shift
Average Return109.803
27
Offline Reinforcement LearningHopper Medium BodyMass Shift
Average Return82.786
27
Offline Reinforcement Learning1T10S Hopper (Medium-Expert)
Score111.587
26
Offline Reinforcement Learning1T10S Hopper (Medium-Replay)
Score98.988
26
Offline Reinforcement LearningHopper 1T10S (Medium)
Score101.244
26
Reinforcement LearningHopper v3
Average Final Return4,104
26
Offline Reinforcement LearningHopper medium-expert
Normalized Score111.6
24
Offline Reinforcement Learninghopper Mixed Dataset
Normalized Reward108
24
LocomotionHopper IID (test)
Mean Episode Reward1,859
24
Locomotion ControlHopper sigma 0.3 (test)
Episode Reward1,368
24
LocomotionHopper
Convergence (%)100
20
Offline Reinforcement LearningHopper expert
Normalized Score112.8
19
Offline Reinforcement LearningHopper Medium-Expert BodyMass Shift
Average Return77.279
18
Offline Reinforcement LearningHopper Medium-Replay JointNoise Shift
Average Return93.704
18
Offline Reinforcement LearningHopper Medium-Expert 1T10S
Average Return109.803
18
Offline Reinforcement LearningHopper Medium-Replay 1T10S
Average Return93.704
18
Offline Reinforcement LearningHopper Medium 1T10S
Average Return78.325
18
Continuous ControlHopper 3-Dof
Final Return2,735
18
Locomotion ControlHopper sigma 0.7 (test)
Episode Reward443
18
Locomotion ControlHopper sigma 0.5 (test)
Episode Reward729
18
Locomotion ControlHopper sigma 0.1 (test)
Episode Reward1,859
18
Showing 25 of 133 rows