Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MuJoCo Hopper

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningMuJoCo Hopper v2
Average Return4,408
18
Continuous ControlMuJoCo Hopper logarithmic adversary v1
Average Performance Score2,577
12
Continuous ControlMuJoCo Hopper H=20
Normalized Return33.3
10
Continuous ControlMuJoCo Hopper H=10
Normalized Return13.2
10
Offline Reinforcement LearningMuJoCo Hopper Medium-Replay v2
Avg Normalized Score100.02
8
Offline Reinforcement LearningMuJoCo Hopper Medium-Expert v2
Avg Normalized Score107
7
Offline Reinforcement LearningMuJoCo Hopper Medium v2
Averaged Normalized Score102
7
Continuous ControlMuJoCo Hopper v4 (test)
Mean Episodic Return3,338
6
Continuous ControlMuJoCo Hopper 2-p v4
Normalized Return106
6
Continuous ControlMuJoCo Hopper 4-p v4
Normalized Return99
6
Continuous ControlMuJoCo Hopper v2 (train)
Mean Return3,713
6
Multi-objective Reinforcement LearningMuJoCo Hopper 2
Hypervolume (HV)22.09
5
Reinforcement LearningMuJoCo Hopper epsilon=0.075 (test)
Natural Return3,684
5
Offline Inverse Reinforcement LearningMuJoCo hopper (medium-exp)
Average Reward3,512.09
5
Offline Inverse Reinforcement LearningMuJoCo hopper (medium-replay)
Average Reward3,512.09
5
Offline Inverse Reinforcement LearningMuJoCo hopper medium
Average Reward3,512.09
5
Continuous ControlMuJoCo Hopper v3 (1M steps)
Average Return3,687
5
Continuous ControlMuJoCo Hopper v3 (500K steps)
Average Return3,548
5
Policy OptimizationMuJoCo Hopper H=40
Return71
5
Continuous ControlMuJoCo Hopper (H=40)
Normalized Return71
5
Multi-objective Reinforcement LearningMuJoCo Hopper-3
Hypervolume (HV)3.889
4
Mass GeneralizationMuJoCo Hopper 1.5–2.0× mass
Retention Rate62.4
4
Dynamics Model PredictionMuJoCo Hopper medium-replay v2 (test)
RMSE0.408
4
Dynamics Model PredictionMuJoCo Hopper expert v2 (test)
RMSE0.322
4
Dynamics Model PredictionMuJoCo Hopper medium v2 (train)
RMSE0.034
4
Showing 25 of 32 rows