Share your thoughts, 1 month free Claude Pro on usSee more

Reinforcement Learning on DeepMind Control Acrobot Swingup (Steps to 75% Return)

475,000Steps to 75% Return

Mind Dreamer

Updated 2mo ago

Evaluation Results

Method	Links
Mind Dreamer 2026.05		475,000
Mind Dreamer 2026.05		475,400
Dreamer V2 2026.05		512,600
Mind Dreamer 2026.05		574,900
Dreamer V3 2026.05		985,400
Plan2Explore 2026.05		3,020,000