Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World

About

Bridging the 'reality gap' that separates simulated robotics from experiments on hardware could accelerate robotic research through improved data availability. This paper explores domain randomization, a simple technique for training models on simulated images that transfer to real images by randomizing rendering in the simulator. With enough variability in the simulator, the real world may appear to the model as just another variation. We focus on the task of object localization, which is a stepping stone to general robotic manipulation skills. We find that it is possible to train a real-world object detector that is accurate to $1.5$cm and robust to distractors and partial occlusions using only data from a simulator with non-realistic random textures. To demonstrate the capabilities of our detectors, we show they can be used to perform grasping in a cluttered environment. To our knowledge, this is the first successful transfer of a deep neural network trained only on simulated RGB images (without pre-training on real images) to the real world for the purpose of robotic control.

Josh Tobin, Rachel Fong, Alex Ray, Jonas Schneider, Wojciech Zaremba, Pieter Abbeel• 2017

Related benchmarks

TaskDatasetResultRank
Reinforcement LearningMuJoCo HumanoidStandup
Average Performance1.11e+5
24
Reinforcement LearningMuJoCo Half-Cheetah
Average Return6.17e+3
18
NavigationMiniWorld FourRooms
Success Rate62
15
Reinforcement LearningMuJoCo Ant
Average Return7.50e+3
14
Reinforcement LearningMuJoCo Walker
Average Return4.61e+3
14
Reinforcement LearningMuJoCo Hopper
Average Return1.69e+3
14
Continuous ControlMuJoCo Ant fixed random adversary L=0.1
Average Performance7.57e+3
12
Reinforcement LearningAnt fixed linear adversary
Average Performance7.71e+3
12
Worst-case time-constrained reinforcement learningAnt MuJoCo (test)
Normalized Worst-Case Reward1.58
12
Continuous ControlMuJoCo v2 (test)
Ant Score1.65
12
Showing 10 of 123 rows
...

Other info

Follow for update