Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Acrobot v1

-140.16Mean Return

GB-DQN

-8,327.9136-6,202.2468-4,076.58-1,950.9132Dec 18, 2025Dec 26, 2025Jan 3, 2026Jan 11, 2026Jan 19, 2026Jan 27, 2026Feb 5, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2025.12
-140.1688.52-
2026.02
-147.1671.06-
2025.12
-149.5985.72-
2025.12
-154.8285.56-
2026.02
-156.982.4-
2026.02
-164.6984.03-
2025.12
-166.3297.39-
-172.6106.6-
2025.12
-264.58130.77-
2026.02
-49819.9-
2025.12
-5,000-0
2025.12
-7,518-85
2025.12
-7,715-90
2025.12
-8,013-95