Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MuJoCo

Benchmarks

Task NameDataset NameSOTA ResultTrend
Time Series ReconstructionMuJoCo (test)
MSE0.285
51
Offline Reinforcement LearningMujoCo halfcheetah
Normalized Return60.8
33
Offline Reinforcement LearningMuJoCo hopper D4RL (medium-replay)
Normalized Return101.6
26
Continuous ControlMuJoCo Ant v4
Normalized Return136
24
Reinforcement LearningMuJoCo HumanoidStandup
Average Performance130,892
24
3D Dynamics PredictionMuJoCo Fall-and-rebound scenario
Translation Error (m)0.0048
20
Offline Reinforcement LearningMuJoCo halfcheetah-medium-replay D4RL
Normalized Return54.1
20
Offline Reinforcement LearningMuJoCo walker2d-medium D4RL
Normalized Return88.2
20
Offline Reinforcement LearningMuJoCo halfcheetah-medium D4RL
Normalized Return65.6
20
Continuous ControlMuJoCo Reacher v4
Normalized Performance103
18
Continuous ControlMuJoCo Pusher v4
Normalized Performance1.36
18
Continuous ControlMuJoCo HumanoidStandup v4
Normalized Performance1.29
18
Continuous ControlMuJoCo Hopper v4
Normalized Performance1.25
18
Offline Reinforcement LearningMuJoCo halfcheetah-medium-expert D4RL
Normalized Return101.1
18
Reinforcement LearningMuJoCo Half-Cheetah
Average Return13,300
18
HalfCheetahMujoco
Reward9.48
16
AntMuJoCo
Recovery Time (%)5.9
16
Reinforcement LearningMuJoCo Ant
Average Return7,889.1
14
Reinforcement LearningMuJoCo Hopper
Average Return3,876
14
Offline Reinforcement LearningMuJoCo hopper-medium D4RL
Normalized Return96.9
13
Continuous ControlMuJoCo Hopper fixed random adversary L=0.1
Average Performance2,365
12
Reinforcement LearningMuJoCo Hopper (test)
Average Reward1,946
12
Reinforcement LearningMuJoCo HalfCheetah (test)
Avg Performance8,174
12
Continuous ControlMuJoCo v2 (test)
Ant Score1.78
12
Continuous ControlMuJoCo Reacher
Average Reward-3.85
12
Showing 25 of 91 rows