Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

cartpole

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningCartPole Pure
Average Reward (2/0.5)200
30
Reinforcement LearningCartPole v1 (test)
Total Reward500
25
Continuous ControlCartpole
Median Samples1.63
10
Reinforcement LearningCartPole Pure-8-40
Average Reward (EpLen=2, DF=0.5)200
10
Reinforcement LearningCartPole
Average Reward989.5
9
Actuator InversionCartpole (Ceval-in)
AER659
8
Actuator InversionCartpole (train)
AER658
8
ControlCartpole swing-up
Median Samples111
8
Reinforcement LearningCartPole v0
Mean Score199.2
8
Safe Reinforcement LearningSafe CartPole
Training Time (s)68.7
7
Reinforcement LearningSafe CartPole
Episode Reward200
7
Control TaskCartPole (test)
Average Reward494.09
6
Robustness EvaluationCartPole A=9.5 (test)
Average Reward231.8
6
Robustness EvaluationCartPole A=9.0 (test)
Average Reward830.9
6
Robustness EvaluationCartPole A=8.5 (test)
Average Reward810.1
6
Robustness EvaluationCartPole A=8.0 (test)
Average Reward1,000
6
Sensory-motor controlCartPole *2
Reward (First Iter, Worst)9.8
5
Sensory-motor controlCartPole
Reward (First Iteration, Worst Rep)42.15
5
Reinforcement Learningcartpole
Return200
5
Reinforcement LearningCartPole v1
Return354,122
5
Reinforcement LearningCartPole 10% action noise (test)
Return (Noisy)279
4
Reinforcement LearningCartPole Clean (test)
Clean Return354,122
4
Neural Network Verificationcartpole
Time (s)142
3
Reinforcement LearningCartPole
Total Episodes2,000,000
3
Meta-Reinforcement LearningCartpole fl-ood
FLOPs (k)0.004
3
Showing 25 of 35 rows