Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Acrobot

-82.5Average Returns

DTSemNet

-174.228-150.414-126.6-102.786Jan 26, 2026Feb 12, 2026Mar 1, 2026Mar 18, 2026Apr 4, 2026Apr 21, 2026May 8, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2026.05
-82.5-
2026.05
-83.1-
2026.05
-83.92-
2026.05
-84-
2026.01
-86.2-
2026.01
-88.4-
2026.05
-88.6-
2026.01
-93.3-
2026.01
-94.1-
2026.01
-170.7-
2026.03
-63.5
2026.03
-213.6
2026.03
-61.9
2026.03
-90.6
2026.03
-78.67