Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Half Cheetah

Benchmarks

Task NameDataset NameSOTA ResultTrend
LocomotionHalf Cheetah IID (test)
Mean Episode Reward2,278
24
Locomotion ControlHalf Cheetah sigma 0.3 (test)
Episode Reward1,577
24
Locomotion ControlHalf Cheetah sigma 0.7 (test)
Reward462
18
Locomotion ControlHalf Cheetah sigma 0.5 (test)
Episode Reward1,117
18
Locomotion ControlHalf Cheetah sigma 0.1 (test)
Episode Reward2,278
18
Offline Meta-Reinforcement LearningHalf-Cheetah-Vel sampled 10 unseen (test)
Average Return-48.4
10
Reinforcement LearningHalf-cheetah-velocity (train)
Runtime (hours)2
7
Inverse Reinforcement LearningHalf Cheetah (Target)
Mean Cumulative Reward6,420.38
6
Inverse Reinforcement LearningHalf Cheetah
Normalized Performance83
6
Robotic ControlHalf Cheetah
AP-3.58
6
LocomotionHalf-Cheetah Across-episode Reward and Dynamics Changes A-EP (R+D) (test)
Average Final Return-15.2
6
LocomotionHalf-Cheetah Across-episode Reward Changes A-EP (R) (test)
Avg Final Return-10.9
6
LocomotionHalf-Cheetah Within-episode Dynamics Changes W-EP (D) (test)
Average Final Return-48.2
6
LocomotionHalf-Cheetah Across-episode Agent Changes A-EP (A) (test)
Average Final Return-9.6
6
LocomotionHalf-Cheetah Across-episode Dynamics Changes A-EP (D) (test)
Avg Final Return-24.4
6
Imitation LearningHalf-Cheetah
Mean Score1,839.8
6
LocomotionHalf Cheetah sigma=0.7
Reward482
6
LocomotionHalf Cheetah sigma=0.5
Reward669
6
LocomotionHalf Cheetah sigma=0.1
Reward834
6
LocomotionHalf Cheetah
Reward6,713.3
6
Trajectory OptimizationHalf Cheetah
Computational Time (s)26.4
5
Reinforcement Learninghalf-cheetah 10-p
Normalized Return80.5
4
Reinforcement Learninghalf-cheetah 2-p
Normalized Return44.9
4
Reinforcement Learninghalf-cheetah 4-p
Normalized Return54.7
4
Long-horizon predictionHalf Cheetah
NLL-2.8
4
Showing 25 of 29 rows