Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MuJoCo Hopper

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningMuJoCo Hopper v2
Average Return4,408
18
Continuous ControlMuJoCo Hopper logarithmic adversary v1
Average Performance Score2,577
12
Continuous ControlMuJoCo Hopper H=20
Normalized Return33.3
10
Continuous ControlMuJoCo Hopper H=10
Normalized Return13.2
10
Offline Reinforcement LearningMuJoCo Hopper Medium-Replay v2
Avg Normalized Score100.02
8
Offline Reinforcement LearningMuJoCo Hopper Medium-Expert v2
Avg Normalized Score107
7
Offline Reinforcement LearningMuJoCo Hopper Medium v2
Averaged Normalized Score102
7
Continuous ControlMuJoCo Hopper 2-p v4
Normalized Return106
6
Continuous ControlMuJoCo Hopper 4-p v4
Normalized Return99
6
Continuous ControlMuJoCo Hopper v2 (train)
Mean Return3,713
6
Reinforcement LearningMuJoCo Hopper epsilon=0.075 (test)
Natural Return3,684
5
Offline Inverse Reinforcement LearningMuJoCo hopper (medium-exp)
Average Reward3,512.09
5
Offline Inverse Reinforcement LearningMuJoCo hopper (medium-replay)
Average Reward3,512.09
5
Offline Inverse Reinforcement LearningMuJoCo hopper medium
Average Reward3,512.09
5
Continuous ControlMuJoCo Hopper v3 (1M steps)
Average Return3,687
5
Continuous ControlMuJoCo Hopper v3 (500K steps)
Average Return3,548
5
Policy OptimizationMuJoCo Hopper H=40
Return71
5
Continuous ControlMuJoCo Hopper (H=40)
Normalized Return71
5
Dynamics Model PredictionMuJoCo Hopper medium-replay v2 (test)
RMSE0.408
4
Dynamics Model PredictionMuJoCo Hopper expert v2 (test)
RMSE0.322
4
Dynamics Model PredictionMuJoCo Hopper medium v2 (train)
RMSE0.034
4
Policy GradientMuJoCo Hopper (final 20 iterations)
Average Return185.9
3
Continuous control locomotionMuJoCo Hopper v3 (train)
Avg Performance (1M Steps)2,544
2
Reinforcement LearningMuJoCo Hopper v4
Metric-
0
Showing 24 of 24 rows