Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning Control on Gymnasium Pendulum v1
Loading...
0.397
MSI (s)
DQN
0.07772
0.16061
0.2435
0.32639
May 11, 2026
MSI (s)
Updated 20d ago
Evaluation Results
Method
Method
Links
MSI (s)
DQN
wc=10
2026.05
0.397
Pref-DQN (1 model)
wc=16
2026.05
0.393
SAC
wc=10
2026.05
0.379
Classical STC (B3)
2026.05
0.202
PPO
2026.05
0.09
Feedback
Search any
task
Search any
task