Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

BipedalWalker

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningBipedalWalker
Average Episode Reward314.24
10
Continuous ControlBipedalWalker Nonmarkov v3
AUC@T184.7
9
Continuous ControlBipedalWalker v3
Episodic Cumulative Reward276.98
8
Reinforcement LearningBipedalWalker v3
Return180.58
2
Reinforcement Learningbipedalwalker Sticky
AUC@T42,687,915.83
2
Reinforcement Learningbipedalwalker Noisy
AUC@T32,301,685.83
2
Reinforcement Learningbipedalwalker (Clean)
AUC@T9,392,950.11
2
Reinforcement LearningBipedalWalker standard (test)
Length17
2
Interpretability EvaluationBipedalWalker
Interpretability Score3.2
2
Showing 9 of 9 rows