Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Cart Pole

Benchmarks

Task NameDataset NameSOTA ResultTrend
Policy EvaluationCart-Pole off-policy perfect features
MSE0.17
7
Policy EvaluationCart-Pole on-policy, perfect features
MSE0.15
7
Policy EvaluationCart-Pole off-policy, impoverished features
MSE2.33
7
Policy EvaluationCart-Pole on-policy, impoverished features
MSE2.37
7
Reinforcement LearningCart-Pole Domain Generalization - Pole Length OpenAI Gym (3 held out domains)
Average Reward175.25
6
Reinforcement LearningCart-Pole OpenAI Gym (3 held-out domains (variable pole length and cart mass))
Return170.81
6
Classic ControlCart Pole OpenAI Gym (evaluation)
Mean Score178.3
5
Optimal ControlCart Pole
Final Cost17.49
2
End-to-end learning and planningCart-Pole Swing-Up
Cost (Best)87.78
1
Q-only open-loop forecastingCart-Pole windy
Metric-
0
Showing 10 of 10 rows