Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MuJoCo Humanoid

Benchmarks

Task NameDataset NameSOTA ResultTrend
Continuous ControlMuJoCo Humanoid v4
Normalized Performance (Ret_nor)115
18
Reinforcement LearningMuJoCo Humanoid v2
Average Return10,490
18
Reinforcement LearningMuJoCo Humanoid
Average Return10,249
12
Continuous ControlMuJoCo Humanoid v4 (test)
Mean Episodic Return11,940
6
Mass GeneralizationMuJoCo Humanoid 1.5–2.0× mass
Retention Rate91.1
6
Continuous ControlMuJoCo Humanoid v2 (train)
Mean Return6,242
6
Reinforcement LearningMuJoCo Humanoid v2 (test)
Max Avg Return9,080.54
6
Continuous ControlMuJoCo Humanoid v5 (test)
Average Return5,701.2
4
Meta-Reinforcement LearningMuJoCo Humanoid Body variation (test)
CVaR 0.05 Return1,365
2
Meta-Reinforcement LearningMuJoCo Humanoid Mass variation (test)
CVaR 0.05 Return1,378
2
Meta-Reinforcement LearningMuJoCo Humanoid Velocity variation (test)
CVaR 0.05 Return833
2
Continuous control locomotionMuJoCo Humanoid v3 (train)
Avg Performance (1M Steps)665
2
Showing 12 of 12 rows