Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

hopper

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningHopper v5
Average Return3,732.5
93
Offline Reinforcement Learninghopper medium
Normalized Score3,729
52
Offline Reinforcement Learninghopper medium-replay
Normalized Score113
44
Offline Reinforcement LearningHopper D4RL v2 (offline)
Average Score100.8
32
Offline Reinforcement LearningHopper medium-expert
Normalized Score111.6
24
Offline Reinforcement Learninghopper Mixed Dataset
Normalized Reward108
24
LocomotionHopper IID (test)
Mean Episode Reward1,859
24
Locomotion ControlHopper sigma 0.3 (test)
Episode Reward1,368
24
LocomotionHopper
Convergence (%)100
20
Offline Reinforcement LearningHopper expert
Normalized Score112.8
19
Continuous ControlHopper 3-Dof
Final Return2,735
18
Locomotion ControlHopper sigma 0.7 (test)
Episode Reward443
18
Locomotion ControlHopper sigma 0.5 (test)
Episode Reward729
18
Locomotion ControlHopper sigma 0.1 (test)
Episode Reward1,859
18
Offline Reinforcement LearningHopper kinematic shifts
Score97
16
Offline Reinforcement LearningHopper
Average Return2,116.2
16
Reinforcement LearningHopper
Avg Episode Reward2,743.9
15
Continuous Controlhopper
Average Reward2,133,326
15
Offline Reinforcement LearningHopper Medium Noise 0
Normalized Return95
14
Offline Reinforcement LearningHopper Medium (Noise 5)
Normalized Return70.67
14
Cross-Domain Offline Policy Adaptationhopper-med Source Target
Normalized Score41.6
14
Offline Policy Adaptationhopper medium-expert
Normalized Score53.4
14
Offline Policy AdaptationHopper medium-replay
Normalized Score36.8
14
Offline Reinforcement LearningHopper random
Normalized Score32.2
14
Reinforcement LearningHopper v4
Average Return27,721,263
13
Showing 25 of 104 rows