Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

cartpole

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningCartPole v0
Mean Score200
48
Reinforcement LearningCartPole Pure
Average Reward (2/0.5)200
30
Reinforcement LearningCartPole
Average Reward1,000
29
Reinforcement LearningCartPole v1 (test)
Total Reward500
25
Classic Discrete ControlCartPole v1
Mean Episodic Return495
18
Reinforcement LearningCartPole v1
Return354,122
16
Imitation LearningCartpole v1 (test)
Optimality (%)100
15
Reinforcement LearningCartPole
Wall-clock Training Time (min)0.1
13
Continuous ControlCartpole
Median Samples1.63
10
Reinforcement LearningCartPole Pure-8-40
Average Reward (EpLen=2, DF=0.5)200
10
Reinforcement LearningCartPole
Max Return500
9
ForecastingCartPole Time-Varying (CP TV) (held-out trajectories)
MSE0.0007
8
ForecastingCartPole Time-Invariant (held-out trajectories)
MSE2.28
8
Reinforcement LearningCartPole Setting C v0 (test)
Performance (2/0.15)236.4
8
Reinforcement LearningCartPole Setting B v0 (test)
Steps Survived (Config 0.15)192.4
8
Reinforcement LearningCartPole Setting A v0 (test)
Performance Score (Config 2/0.85)98.2
8
Actuator InversionCartpole (Ceval-in)
AER659
8
Actuator InversionCartpole (train)
AER658
8
ControlCartpole swing-up
Median Samples111
8
Safe Reinforcement LearningSafe CartPole
Training Time (s)68.7
7
Reinforcement LearningSafe CartPole
Episode Reward200
7
Offline Reinforcement LearningCartPole 100k Gym
Returns100
6
Offline Reinforcement LearningCartPole Gym (10k)
Returns100
6
Offline Reinforcement LearningCartPole 1k Gym
Returns90
6
Reinforcement LearningCartPole (CP) (test)
Cumulative Reward500
6
Showing 25 of 63 rows