Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BipedalWalker

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningBipedalWalker
Average Episode Reward314.24
20
Continuous ControlBipedalWalker v3
Episodic Cumulative Reward298.4
15
LocomotionBipedalWalker Overall Mean
Mean Return89.95
11
LocomotionBipedalWalker Roughness terrain
Mean Return224.4
11
LocomotionBipedalWalker Stump terrain
Mean Return34.16
11
LocomotionBipedalWalker PitGap terrain
Mean Return-7.65
11
LocomotionBipedalWalker Stairs terrain
Mean Return-0.66
11
LocomotionBipedalWalker Hardcore terrain
Mean Return86.83
11
LocomotionBipedalWalker Basic terrain
Mean Return293.67
11
Solved RateBipedalWalker Zero-Shot (test)
Basic Solved Rate100
9
Reinforcement LearningBipedalWalker
Training Time (h)18.38
9
Continuous ControlBipedalWalker Nonmarkov v3
AUC@T184.7
9
Robotic ControlBipedalWalker v3
Local Optima Escape Rate83.5
7
Environment InteractionBipedalWalker
Environment Steps (M)347
7
Adaptability EvaluationBipedalWalker mass variations (test)
AUC295.65
6
Adaptability EvaluationBipedalWalker friction variations (test)
AUC1,429.66
6
Quality-DiversityBipedalWalker
GT QD Score6.09
6
Reinforcement LearningBipedalWalker v3
Return273.2
6
Reinforcement Learningbipedalwalker Sticky
AUC@T42,687,915.83
2
Reinforcement Learningbipedalwalker Noisy
AUC@T32,301,685.83
2
Reinforcement Learningbipedalwalker (Clean)
AUC@T9,392,950.11
2
Reinforcement LearningBipedalWalker standard (test)
Length17
2
Interpretability EvaluationBipedalWalker
Interpretability Score3.2
2
Showing 23 of 23 rows