Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Acrobot

-86.2Average Returns

AC-SGD

-174.08-151.265-128.45-105.635Jan 26, 2026
Updated 18d ago

Evaluation Results

MethodLinks
2026.01
-86.2-
2026.01
-88.4-
2026.01
-93.3-
2026.01
-94.1-
2026.01
-170.7-
2026.03
-63.5
2026.03
-213.6
2026.03
-61.9
2026.03
-90.6
2026.03
-78.67