Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Humanoid

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningHumanoid
Zero-Shot Reward90,921,063
30
Continuous ControlHumanoid 17-Dof
Final Return13,860
21
Robot LocomotionHumanoid
Cumulative Reward5,299
16
Worst-case time-constrained reinforcement learningHumanoid MuJoCo (test)
Normalized Worst-Case Reward4.02
12
Robot LocomotionHumanoid v1 (test)
Total Score93,123.84
12
Reinforcement LearningHumanoid v5
Performance Score5,906.7
11
Constrained Reinforcement LearningHumanoid
Episodic Reward1,734.1
8
Reinforcement LearningHumanoid gravity v2
Average Return6,360
8
Continuous ControlHumanoid v3
Average Return4,963
7
Continuous ControlHumanoid v5
Average Return5,906.7
7
LocomotionHumanoid v3
Average Return5,353.5
7
Reinforcement LearningHumanoid v3
Avg Final Return11,888
7
Reinforcement LearningHumanoid v2
Return8,048
7
LocomotionHumanoid v2
Average Return10,490
6
LocomotionHumanoid Environment Faults v5
Episodic Return198,257,932
5
LocomotionHumanoid Dynamic Faults v5
Episodic Return152,825,979
5
LocomotionHumanoid Actuator Faults v5
Episodic Return139,815,624
5
Motion in-betweeningHumanoid User Study (test)
Similar Score60.12
5
Continuous LocomotionHumanoid
Ground-truth Reward275.06
5
Trajectory OptimizationHumanoid Standup
Computational Time (s)17.6
5
Continuous ControlHumanoid Mujoco 1000k steps (train)
Training Time (h)11.43
4
Continuous ControlHumanoid Mujoco 500k steps (train)
Time (h)5.72
4
Continuous ControlHumanoid Mujoco 300k steps (train)
Time (h)3.43
4
Locomotion Diversity DiscoveryHumanoid visual input
Diversity Score0.71
3
Motion ImitationHumanoid Spinkick
Normalized Return77
3
Showing 25 of 31 rows