Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Acrobot

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningAcrobot v1
Mean Return-140.16
14
Continuous ControlAcrobot Nonmarkov v1 (test)
AUC@T-82.7
9
Identifying Optimal TrajectoriesAcrobot v1 (top-5 trajectories)
Average Trajectory Length73.2
6
Reinforcement LearningAcrobot
Average Returns-86.2
5
Acrobot ControlAcrobot
Success Rate100
4
Reinforcement LearningAcrobot standard (train/test)
Model Parameters850
4
Sensory-motor controlAcrobot Gymnasium
Mean Best Reward-77.3
2
Reinforcement Learningacrobot Sticky
AUC@T-84,528,355.1
2
Reinforcement Learningacrobot Noisy
AUC @ T-80,111,477.55
2
Reinforcement Learningacrobot Clean
AUC@T-79,144,263.52
2
Continuous ControlAcrobot Nonmarkov v1
AUC@T-
0
Showing 11 of 11 rows