Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Walker

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningWalker
Average Returns1,035.52
38
LocomotionWalker IID (test)
Mean Episode Reward1,909
24
Locomotion ControlWalker sigma 0.1 (test)
Episode Reward1,909
24
Offline Reinforcement LearningWalker Gym-MuJoCo Medium-Expert D4RL
Normalized Score111.6
18
Locomotion ControlWalker sigma 0.7 (test)
Episode Reward289
18
Locomotion ControlWalker sigma 0.5 (test)
Episode Reward518
18
Locomotion ControlWalker sigma 0.3 (test)
Episode Reward908
18
Offline Reinforcement LearningWalker Medium Gym-MuJoCo D4RL
Normalized Score84.7
16
Reinforcement LearningWalker fixed linear adversary
Average Performance5,256
12
Worst-case time-constrained reinforcement learningWalker MuJoCo (test)
Normalized Worst-Case Reward1.69
12
Robust Reinforcement LearningWalker fixed exponential adversary MuJoCo
Average Performance5,310
12
Continuous ControlWalker MuJoCo (test)
Worst-case Performance5,724
12
Robot LocomotionWalker v1 (test)
Total Reward2,603.59
12
HurdlesWalker robot
Average Return15.3
9
Actuator InversionWalker (Ceval-in)
AER849
8
Actuator InversionWalker C (train)
AER845
8
Robotic ControlWalker V
Average Return61,227
6
Robotic ControlWalker-P
Average Return1,123,176
6
LocomotionWalker sigma=0.7
Reward504
6
LocomotionWalker sigma=0.5
Reward743
6
LocomotionWalker sigma=0.3
Reward887
6
Robotic control optimizationWalker
Generations10
5
Reinforcement LearningWalker-P
Time Cost3
5
Reinforcement Learningwalker 10-p
Normalized Return102
4
Reinforcement Learningwalker 2-p
Normalized Return130.6
4
Showing 25 of 27 rows