Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on DeepMind Control Acrobot Swingup (Steps to 75% Return)

475,000Steps to 75% Return

Mind Dreamer

373,2001,060,3501,747,5002,434,650May 15, 2026
Updated 16d ago

Evaluation Results

MethodLinks
2026.05
475,000
2026.05
475,400
2026.05
512,600
2026.05
574,900
2026.05
985,400
3,020,000