Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Humanoid

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningHumanoid
Zero-Shot Reward90,921,063
32
Reinforcement LearningHumanoid v3
Avg Final Return11,888
26
Humanoid LocomotionHumanoid Randomized Task (OOD Sweep)
Reward-3.58
24
High-Dimensional Bayesian OptimizationHumanoid d = 6392
Rank1
21
Continuous ControlHumanoid 17-Dof
Final Return13,860
21
Robot LocomotionHumanoid
Cumulative Reward5,299
16
Continuous ControlHumanoid MuJoCo v2 (evaluation)
Action Performance (p_act=0.1)5,078.3
14
Continuous ControlHumanoid v5
Average Return5,906.7
13
Reinforcement LearningHumanoid (delta=[0.8^6, 0.5^6, 0.2^5], kappa=4.0) v5 (test)
Return5,620
12
Worst-case time-constrained reinforcement learningHumanoid MuJoCo (test)
Normalized Worst-Case Reward4.02
12
Robot LocomotionHumanoid v1 (test)
Total Score93,123.84
12
Reinforcement LearningHumanoid v5
Performance Score5,906.7
11
LocomotionHumanoid v4
Mean Episode Return7,365.7
10
LocomotionHumanoid
Relative Return Improvement18.52
10
Reinforcement LearningHumanoid v4
Reward5,715
9
Black-box OptimizationHumanoid
Objective Value669.52
8
High-Dimensional LocomotionHumanoid v4 (test)
Reward6,907.99
8
Reinforcement LearningHumanoid v5
Coefficient of Variation (%)6.3
8
Reinforcement LearningHumanoid v5
Average Returns5,228
8
Constrained Reinforcement LearningHumanoid
Episodic Reward1,734.1
8
Reinforcement LearningHumanoid gravity v2
Average Return6,360
8
Trajectory OptimizationHumanoid Standup
Computational Time (s)17.6
8
Continuous ControlHumanoid v4
Average Cumulative Reward4,978.5
7
Robotic ControlHumanoid v4
Local Optima Escape Rate72.3
7
Continuous ControlHumanoid
Humanoid Return (p_act=0.1)680.1
7
Showing 25 of 61 rows