Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MuJoCo Walker2d

Benchmarks

Task NameDataset NameSOTA ResultTrend
Continuous ControlMuJoCo Walker2d v4
Normalized Performance13,060
34
Offline Reinforcement LearningMuJoCo walker2d medium-replay D4RL
Normalized Return94.1
20
Offline Reinforcement LearningMuJoCo walker2d medium-expert D4RL
Normalized Return116.6
18
Reinforcement LearningMuJoCo Walker2d v2
Average Return8,004
18
Reinforcement LearningMuJoCo Walker2d v5
Mean Episodic Return5,222
17
Continuous ControlMuJoCo Walker2d (H=10)
Normalized Return14.9
10
Offline Reinforcement LearningMuJoCo walker2d medium 1M
Final Score85.4
7
Reinforcement LearningMuJoCo Walker2d 1.5 density v1 (test)
Reward2,674
7
LocomotionMuJoCo Walker2d Medium-Replay D4RL
Average Normalized Score85.3
6
Continuous ControlMuJoCo Walker2d 10-p v4
Normalized Return102
6
Continuous ControlMuJoCo Walker2d 4-p v4
Normalized Return94.2
6
Continuous ControlMuJoCo Walker2d v2 (train)
Mean Return5,278
6
Reinforcement LearningSparse MuJoCo Walker2d v2 (test)
Max Return886.6
6
Reinforcement LearningMuJoCo Walker2d epsilon=0.05 (test)
Natural Return4,875
5
Offline Inverse Reinforcement LearningMuJoCo walker2d medium-exp
Average Reward5,383.98
5
Offline Inverse Reinforcement LearningMuJoCo walker2d (medium-replay)
Avg Reward5,383.98
5
Offline Inverse Reinforcement LearningMuJoCo walker2d medium
Avg Reward5,383.98
5
Continuous ControlMuJoCo Walker2d 1M steps v3
Average Return5,099
5
Continuous ControlMuJoCo Walker2d v3 (500K steps)
Average Return4,034
5
Policy OptimizationMuJoCo Walker2d H=40
Return221.1
5
Policy OptimizationMuJoCo Walker2d H=20
Return60.7
5
Continuous ControlMuJoCo Walker2d (H=40)
Normalized Return221.1
5
Continuous ControlMuJoCo Walker2d (H=20)
Normalized Return60.7
5
Continuous ControlMuJoCo Walker2d v5 (test)
Average Return4,417
4
Off-dynamics Reinforcement LearningMuJoCo Walker2d 0.5 density dynamics shift (test)
Reward2,729
4
Showing 25 of 32 rows