Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on DeepMind Control Acrobot Swingup (Steps to 75% Return)
Loading...
475,000
Steps to 75% Return
Mind Dreamer
373,200
1,060,350
1,747,500
2,434,650
May 15, 2026
Steps to 75% Return
Updated 16d ago
Evaluation Results
Method
Method
Links
Steps to 75% Return
Mind Dreamer
Horizon=5
2026.05
475,000
Mind Dreamer
Horizon=15
2026.05
475,400
Dreamer V2
2026.05
512,600
Mind Dreamer
Horizon=10
2026.05
574,900
Dreamer V3
2026.05
985,400
Plan2Explore
2026.05
3,020,000
Feedback
Search any
task
Search any
task