Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Pendulum

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement Learning ControlPendulum v1
Mean Score1,378.78
40
Reinforcement LearningPendulum
Avg Episode Reward-145.49
26
Reinforcement LearningPendulum v1 (test)
Average Return-164.82
16
RegressionPendulum (test)
MSE0.0034
14
Rollout predictionPendulum
Rollout MSE1.05
12
Continuous ControlPendulum
Median Samples5.6
12
Continuous ControlPendulum Nonmarkov v1 (test)
AUC@T-556.9
9
ControlPendulum v0
Median Samples21
9
Transition model estimationPendulum discretized n = 10^5
Failure Rate0
8
Image InterpolationPendulum (test)
MSE1
8
Reinforcement LearningPendulum classical control (1M steps)
Return-133.42
8
Continuous ControlPendulum v1
Average Cumulative Reward-152.4
7
Robotic ControlPendulum v1
Local Optima Escape Rate89.2
7
Reinforcement LearningPendulum PD-C (test)
Cumulative Reward854
6
Continuous Control (Negative Reward)Pendulum Pybullet
Mean Return9,124.6
6
Continuous Control (Positive Reward)Pendulum Pybullet
Return9,043.3
6
Continuous Control (Negative Reward)Pendulum Mujoco
Mean Return8,132.1
6
Continuous Control (Positive Reward)Pendulum Mujoco
Return9,358.4
6
MCTS Aggregation Strategy EvaluationPendulum
MRR0.75
6
angular velocity decoding (prediction)pendulum
R^20.727
6
angular velocity decoding (smoothing)pendulum
R-squared99.7
6
Reinforcement LearningPendulum
Average Decisions1,000
6
Continuous ControlPendulum
Action Repetition1.12
6
Property PredictionPendulum
Pendulum Angle1,555.33
6
Imitation LearningPendulum
Mean Score-179.6
6
Showing 25 of 51 rows