Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on Acrobot
Loading...
-82.5
Average Returns
DTSemNet
-174.228
-150.414
-126.6
-102.786
Jan 26, 2026
Feb 12, 2026
Mar 1, 2026
Mar 18, 2026
Apr 4, 2026
Apr 21, 2026
May 8, 2026
Average Returns
Maximum Return
Updated 23d ago
Evaluation Results
Method
Method
Links
Average Returns
Maximum Return
DTSemNet
Nf (Number of features...
2026.05
-82.5
-
DGT
Nf (Number of features...
2026.05
-83.1
-
VIPER
Nf (Number of features...
2026.05
-83.92
-
Deep RL
Nf (Number of features...
2026.05
-84
-
AC-SGD
batch size=1000, seeds=5
2026.01
-86.2
-
AC-Adam
batch size=1000, seeds=5
2026.01
-88.4
-
ICCT
Nf (Number of features...
2026.05
-88.6
-
AC-CG
batch size=1000, seeds=5
2026.01
-93.3
-
SMAC
batch size=1000, seeds=5
2026.01
-94.1
-
AC-KFAC
batch size=1000, seeds=5
2026.01
-170.7
-
PPO
Number of training see...
2026.03
-
63.5
A2C
Number of training see...
2026.03
-
213.6
DQN
Number of training see...
2026.03
-
61.9
CG-FPD
Number of training see...
2026.03
-
90.6
DF-CWP-CP
Number of training see...
2026.03
-
78.67
Feedback
Search any
task
Search any
task