Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MuJoCo Walker2d

Benchmarks

Task NameDataset NameSOTA ResultTrend
Continuous ControlMuJoCo Walker2d v4
Normalized Performance13,060
39
Offline Reinforcement LearningMuJoCo Walker2d Friction shift
Normalized Score76.96
32
Offline Reinforcement LearningMuJoCo Walker2d Gravity shift
Normalized Score69.48
32
Offline Reinforcement LearningMuJoCo walker2d medium-replay D4RL
Normalized Return94.1
20
Offline Reinforcement LearningMuJoCo walker2d medium-expert D4RL
Normalized Return116.6
18
Reinforcement LearningMuJoCo Walker2d v2
Average Return8,004
18
Reinforcement LearningMuJoCo Walker2d v5
Mean Episodic Return5,222
17
LocomotionMuJoCo Walker2d Medium-Replay D4RL
Average Normalized Score128.6
16
Continuous control locomotionMuJoCo Walker2d v3 (train)
Final Return6,482.6
12
Continuous ControlMuJoCo Walker2d (H=10)
Normalized Return14.9
10
LocomotionMuJoCo Walker2d Friction shift
Normalized Return40.8
8
LocomotionMuJoCo Walker2d Kinematic shift
Normalized Return56.4
8
LocomotionMuJoCo Walker2d Morphology shift
Normalized Return50.5
8
Offline Reinforcement LearningMuJoCo walker2d medium 1M
Final Score85.4
7
Reinforcement LearningMuJoCo Walker2d 1.5 density v1 (test)
Reward2,674
7
Continuous ControlMuJoCo Walker2d 10-p v4
Normalized Return102
6
Continuous ControlMuJoCo Walker2d 4-p v4
Normalized Return94.2
6
Continuous ControlMuJoCo Walker2d v2 (train)
Mean Return5,278
6
Reinforcement LearningSparse MuJoCo Walker2d v2 (test)
Max Return886.6
6
Reinforcement LearningMuJoCo Walker2d epsilon=0.05 (test)
Natural Return4,875
5
Offline Inverse Reinforcement LearningMuJoCo walker2d medium-exp
Average Reward5,383.98
5
Offline Inverse Reinforcement LearningMuJoCo walker2d (medium-replay)
Avg Reward5,383.98
5
Offline Inverse Reinforcement LearningMuJoCo walker2d medium
Avg Reward5,383.98
5
Continuous ControlMuJoCo Walker2d 1M steps v3
Average Return5,099
5
Continuous ControlMuJoCo Walker2d v3 (500K steps)
Average Return4,034
5
Showing 25 of 39 rows