Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Pendulum

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningPendulum v1 (test)
Average Return-164.82
16
Reinforcement LearningPendulum
Avg Episode Reward-145.49
15
RegressionPendulum (test)
MSE0.0034
14
Continuous ControlPendulum
Median Samples5.6
12
Continuous ControlPendulum Nonmarkov v1 (test)
AUC@T-556.9
9
ControlPendulum v0
Median Samples21
9
Transition model estimationPendulum discretized n = 10^5
Failure Rate0
8
Image InterpolationPendulum (test)
MSE1
8
Reinforcement LearningPendulum classical control (1M steps)
Return-133.42
8
Continuous Control (Negative Reward)Pendulum Pybullet
Mean Return9,124.6
6
Continuous Control (Positive Reward)Pendulum Pybullet
Return9,043.3
6
Continuous Control (Negative Reward)Pendulum Mujoco
Mean Return8,132.1
6
Continuous Control (Positive Reward)Pendulum Mujoco
Return9,358.4
6
MCTS Aggregation Strategy EvaluationPendulum
MRR0.75
6
angular velocity decoding (prediction)pendulum
R^20.727
6
angular velocity decoding (smoothing)pendulum
R-squared99.7
6
Reinforcement LearningPendulum
Average Decisions1,000
6
Continuous ControlPendulum
Action Repetition1.12
6
Property PredictionPendulum
Pendulum Angle1,555.33
6
Imitation LearningPendulum
Mean Score-179.6
6
ForecastingPendulum
MSE0.283
5
Causal Representation LearningPendulum
MIC98.94
5
System IdentificationPendulum (test)
Average MSE0.72
5
RegressionPendulum
MSE3.41
5
Offline Decision MakingPendulum visual
Average Return-155.4
4
Showing 25 of 43 rows