Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Pendulum

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement Learning ControlPendulum v1
Mean Score1,378.78
40
Reinforcement LearningPendulum
Avg Episode Reward-145.49
26
State EstimationPendulum
MAE0.7
21
Reinforcement LearningPendulum v1 (test)
Average Return-164.82
16
RegressionPendulum (test)
MSE0.0034
14
Continuous ControlPendulum
Robustness Gap0.72
12
Rollout predictionPendulum
Rollout MSE1.05
12
Continuous ControlPendulum
Median Samples5.6
12
Continuous ControlPendulum v1
Average Cumulative Reward-150.8
11
RegressionPendulum
MSE3.32
11
Parameter EstimationPendulum 90cm
Length (m)1.07
9
Continuous ControlPendulum Nonmarkov v1 (test)
AUC@T-556.9
9
ControlPendulum v0
Median Samples21
9
Transition model estimationPendulum discretized n = 10^5
Failure Rate0
8
Image InterpolationPendulum (test)
MSE1
8
Reinforcement LearningPendulum classical control (1M steps)
Return-133.42
8
Dynamical IdentificationPendulum Numerical
AUC99
7
Robotic ControlPendulum v1
Local Optima Escape Rate89.2
7
Counterfactual GenerationPendulum (test)
MAE (Pendulum (p) | do(p))0.013
6
Reinforcement LearningPendulum
Steps (Mean)80,240
6
Reinforcement LearningPendulum PD-C (test)
Cumulative Reward854
6
Continuous Control (Negative Reward)Pendulum Pybullet
Mean Return9,124.6
6
Continuous Control (Positive Reward)Pendulum Pybullet
Return9,043.3
6
Continuous Control (Negative Reward)Pendulum Mujoco
Mean Return8,132.1
6
Continuous Control (Positive Reward)Pendulum Mujoco
Return9,358.4
6
Showing 25 of 70 rows