Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

cartpole

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningCartPole v0
Mean Score200
48
Reinforcement LearningCartPole Pure
Average Reward (2/0.5)200
30
Reinforcement LearningCartPole v1 (test)
Total Reward500
25
Reinforcement LearningCartPole
Average Reward1,000
20
Continuous ControlCartpole
Median Samples1.63
10
Reinforcement LearningCartPole Pure-8-40
Average Reward (EpLen=2, DF=0.5)200
10
Reinforcement LearningCartPole Setting C v0 (test)
Performance (2/0.15)236.4
8
Reinforcement LearningCartPole Setting B v0 (test)
Steps Survived (Config 0.15)192.4
8
Reinforcement LearningCartPole Setting A v0 (test)
Performance Score (Config 2/0.85)98.2
8
Actuator InversionCartpole (Ceval-in)
AER659
8
Actuator InversionCartpole (train)
AER658
8
ControlCartpole swing-up
Median Samples111
8
Safe Reinforcement LearningSafe CartPole
Training Time (s)68.7
7
Reinforcement LearningSafe CartPole
Episode Reward200
7
Offline Reinforcement LearningCartPole 100k Gym
Returns100
6
Offline Reinforcement LearningCartPole Gym (10k)
Returns100
6
Offline Reinforcement LearningCartPole 1k Gym
Returns90
6
Reinforcement LearningCartPole (CP) (test)
Cumulative Reward500
6
Control TaskCartPole (test)
Average Reward494.09
6
Robustness EvaluationCartPole A=9.5 (test)
Average Reward231.8
6
Robustness EvaluationCartPole A=9.0 (test)
Average Reward830.9
6
Robustness EvaluationCartPole A=8.5 (test)
Average Reward810.1
6
Robustness EvaluationCartPole A=8.0 (test)
Average Reward1,000
6
Reinforcement LearningCartPole
Max Return500
5
Sensory-motor controlCartPole *2
Reward (First Iter, Worst)9.8
5
Showing 25 of 50 rows