Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Humanoid

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningHumanoid
Zero-Shot Reward90,921,063
30
Reinforcement LearningHumanoid v3
Avg Final Return11,888
26
Humanoid LocomotionHumanoid Randomized Task (OOD Sweep)
Reward-3.58
24
Continuous ControlHumanoid 17-Dof
Final Return13,860
21
Robot LocomotionHumanoid
Cumulative Reward5,299
16
Continuous ControlHumanoid MuJoCo v2 (evaluation)
Action Performance (p_act=0.1)5,078.3
14
Continuous ControlHumanoid v5
Average Return5,906.7
13
Worst-case time-constrained reinforcement learningHumanoid MuJoCo (test)
Normalized Worst-Case Reward4.02
12
Robot LocomotionHumanoid v1 (test)
Total Score93,123.84
12
Reinforcement LearningHumanoid v5
Performance Score5,906.7
11
LocomotionHumanoid
Relative Return Improvement18.52
10
Reinforcement LearningHumanoid v5
Coefficient of Variation (%)6.3
8
Reinforcement LearningHumanoid v5
Average Returns5,228
8
Constrained Reinforcement LearningHumanoid
Episodic Reward1,734.1
8
Reinforcement LearningHumanoid gravity v2
Average Return6,360
8
Continuous ControlHumanoid v4
Average Cumulative Reward4,978.5
7
Robotic ControlHumanoid v4
Local Optima Escape Rate72.3
7
Continuous ControlHumanoid
Humanoid Return (p_act=0.1)680.1
7
Continuous ControlHumanoid v3
Average Return4,963
7
LocomotionHumanoid v3
Average Return5,353.5
7
Reinforcement LearningHumanoid v2
Return8,048
7
LocomotionHumanoid v2
Average Return10,490
6
LocomotionHumanoid Environment Faults v5
Episodic Return198,257,932
5
LocomotionHumanoid Dynamic Faults v5
Episodic Return152,825,979
5
LocomotionHumanoid Actuator Faults v5
Episodic Return139,815,624
5
Showing 25 of 47 rows