Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on Acrobot
Loading...
-86.2
Average Returns
AC-SGD
-174.08
-151.265
-128.45
-105.635
Jan 26, 2026
Average Returns
Maximum Return
Updated 18d ago
Evaluation Results
Method
Method
Links
Average Returns
Maximum Return
AC-SGD
batch size=1000, seeds=5
2026.01
-86.2
-
AC-Adam
batch size=1000, seeds=5
2026.01
-88.4
-
AC-CG
batch size=1000, seeds=5
2026.01
-93.3
-
SMAC
batch size=1000, seeds=5
2026.01
-94.1
-
AC-KFAC
batch size=1000, seeds=5
2026.01
-170.7
-
PPO
Number of training see...
2026.03
-
63.5
A2C
Number of training see...
2026.03
-
213.6
DQN
Number of training see...
2026.03
-
61.9
CG-FPD
Number of training see...
2026.03
-
90.6
DF-CWP-CP
Number of training see...
2026.03
-
78.67
Feedback
Search any
task
Search any
task