Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MuJoCo

Benchmarks

Task NameDataset NameSOTA ResultTrend
Time Series ReconstructionMuJoCo (test)
MSE0.285
51
Offline Reinforcement LearningMujoCo halfcheetah
Normalized Return60.8
33
Continuous ControlMuJoCo Hopper v4
Normalized Performance3,592
28
Reinforcement LearningMuJoCo Half-Cheetah
Average Return13,907
28
Offline Reinforcement LearningMuJoCo hopper D4RL (medium-replay)
Normalized Return101.6
26
Offline Reinforcement LearningMuJoCo walker2d-medium D4RL
Normalized Return92.5
26
Continuous ControlMuJoCo Ant
Average Reward6,336
26
Continuous ControlMuJoCo HalfCheetah
Average Reward13,144
25
Continuous ControlMuJoCo Ant v4
Normalized Return136
24
Reinforcement LearningMuJoCo HumanoidStandup
Average Performance130,892
24
Reinforcement LearningMuJoCo Ant
Average Return7,889.1
24
Reinforcement LearningMuJoCo Hopper
Average Return3,876
24
3D Dynamics PredictionMuJoCo Fall-and-rebound scenario
Translation Error (m)0.0048
20
Offline Reinforcement LearningMuJoCo halfcheetah-medium-replay D4RL
Normalized Return54.1
20
Offline Reinforcement LearningMuJoCo halfcheetah-medium D4RL
Normalized Return65.6
20
Continuous ControlMuJoCo Reacher v4
Normalized Performance103
18
Continuous ControlMuJoCo Pusher v4
Normalized Performance1.36
18
Continuous ControlMuJoCo HumanoidStandup v4
Normalized Performance1.29
18
Offline Reinforcement LearningMuJoCo halfcheetah-medium-expert D4RL
Normalized Return101.1
18
Continuous ControlMuJoCo Reacher
Average Reward6.22
18
Reinforcement LearningMuJoCo Hopper v5
Mean Episodic Return3,268
17
HalfCheetahMujoco
Reward9.48
16
AntMuJoCo
Recovery Time (%)5.9
16
Imitation LearningMuJoCo
Hopper Reward109.46
15
Offline Reinforcement LearningMuJoCo walker2d-medium 10K
Score76.3
13
Showing 25 of 144 rows