Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DeepMind Control

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningDeepMind Control Cartpole Balance Sparse
Steps to 75% Return150,700
11
Behavior CloningDeepMind Control (DMC) suite seen/unseen embodiments
Hopper Hop Score49.1
9
Reinforcement LearningDeepMind Control Reacher Hard
Steps to 75% Return (k)589.7
8
Vision-based robot policy learningDeepMind Control
Stand Performance89.1
8
Reinforcement LearningDeepMind Control Walker Walk
Steps to 75% Return116.8
6
Reinforcement LearningDeepMind Control Walker Stand
Steps to 75% Return120
6
Reinforcement LearningDeepMind Control Reacher Easy
Steps to 75% Return Threshold205,000
6
Reinforcement LearningDeepMind Control Quadruped Walk
Steps to Reach 75% Return330,500
6
Reinforcement LearningDeepMind Control Pendulum Swingup
Steps to 75% Return (k)76,600
6
Reinforcement LearningDeepMind Control Hopper Hop
Steps to 75% Return153,500
6
Reinforcement LearningDeepMind Control Finger Turn Hard
Steps to 75% Return333.4
6
Reinforcement LearningDeepMind Control Finger Turn Easy
Steps to 75% Return77,300
6
Reinforcement LearningDeepMind Control Finger Spin
Steps to 75% Return89,100
6
Reinforcement LearningDeepMind Control Cup Catch
Steps to 75% Return104,700
6
Reinforcement LearningDeepMind Control Cartpole Swingup
Steps to 75% Return278,800
6
Reinforcement LearningDeepMind Control Cartpole Balance
Steps to 75% Return203,100
6
Reinforcement LearningDeepMind Control Acrobot Swingup
Steps to 75% Return475,000
6
Reinforcement LearningDeepMind Control Walker Run
Steps to 75% Return205,000
5
Reinforcement LearningDeepMind Control Hopper Stand
Steps to 75% Return (k)245.2
5
Reinforcement LearningDeepMind Control Cheetah Run
Environment Steps to 75% Return (k)145,400
5
Reinforcement LearningDeepMind Control Cartpole Swingup Sparse
Environment Steps to 75% Return383,100
5
Continuous ControlDeepMind Control 3M steps
Acrobot Swingup Score422
5
Continuous ControlDeepMind Control 1.5M steps
Acrobot Swingup Score272
5
Reinforcement LearningDeepMind Control Quadruped Run
Steps to 75% Return345,000
4
Continuous ControlDeepMind Control
Walker-stand Success Rate89.1
4
Showing 25 of 30 rows