Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Half Cheetah

Benchmarks

Task NameDataset NameSOTA ResultTrend
LocomotionHalf Cheetah IID (test)
Mean Episode Reward2,278
24
Locomotion ControlHalf Cheetah sigma 0.3 (test)
Episode Reward1,577
24
Locomotion ControlHalf Cheetah sigma 0.7 (test)
Reward462
18
Locomotion ControlHalf Cheetah sigma 0.5 (test)
Episode Reward1,117
18
Locomotion ControlHalf Cheetah sigma 0.1 (test)
Episode Reward2,278
18
Offline Meta-Reinforcement LearningHalf-Cheetah-Vel sampled 10 unseen (test)
Average Return-48.4
10
Reinforcement LearningHalf-cheetah-velocity (train)
Runtime (hours)2
7
Robotic ControlHalf Cheetah
AP-3.58
6
LocomotionHalf-Cheetah Across-episode Reward and Dynamics Changes A-EP (R+D) (test)
Average Final Return-15.2
6
LocomotionHalf-Cheetah Across-episode Reward Changes A-EP (R) (test)
Avg Final Return-10.9
6
LocomotionHalf-Cheetah Within-episode Dynamics Changes W-EP (D) (test)
Average Final Return-48.2
6
LocomotionHalf-Cheetah Across-episode Agent Changes A-EP (A) (test)
Average Final Return-9.6
6
LocomotionHalf-Cheetah Across-episode Dynamics Changes A-EP (D) (test)
Avg Final Return-24.4
6
Imitation LearningHalf-Cheetah
Mean Score1,839.8
6
LocomotionHalf Cheetah sigma=0.7
Reward482
6
LocomotionHalf Cheetah sigma=0.5
Reward669
6
LocomotionHalf Cheetah sigma=0.1
Reward834
6
Trajectory OptimizationHalf Cheetah
Computational Time (s)26.4
5
Reinforcement Learninghalf-cheetah 10-p
Normalized Return80.5
4
Reinforcement Learninghalf-cheetah 2-p
Normalized Return44.9
4
Reinforcement Learninghalf-cheetah 4-p
Normalized Return54.7
4
Long-horizon predictionHalf Cheetah
NLL-2.8
4
LocomotionHalf-Cheetah Continuous Dynamics Changes CONT (D) (test)
Avg Final Return-12.3
4
LocomotionHalf Cheetah
Metric-
0
Showing 24 of 24 rows