Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World
About
Bridging the 'reality gap' that separates simulated robotics from experiments on hardware could accelerate robotic research through improved data availability. This paper explores domain randomization, a simple technique for training models on simulated images that transfer to real images by randomizing rendering in the simulator. With enough variability in the simulator, the real world may appear to the model as just another variation. We focus on the task of object localization, which is a stepping stone to general robotic manipulation skills. We find that it is possible to train a real-world object detector that is accurate to $1.5$cm and robust to distractors and partial occlusions using only data from a simulator with non-realistic random textures. To demonstrate the capabilities of our detectors, we show they can be used to perform grasping in a cluttered environment. To our knowledge, this is the first successful transfer of a deep neural network trained only on simulated RGB images (without pre-training on real images) to the real world for the purpose of robotic control.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Reinforcement Learning | MuJoCo HumanoidStandup | Average Performance1.11e+5 | 24 | |
| Reinforcement Learning | MuJoCo Half-Cheetah | Average Return6.17e+3 | 18 | |
| Navigation | MiniWorld FourRooms | Success Rate62 | 15 | |
| Reinforcement Learning | MuJoCo Ant | Average Return7.50e+3 | 14 | |
| Reinforcement Learning | MuJoCo Walker | Average Return4.61e+3 | 14 | |
| Reinforcement Learning | MuJoCo Hopper | Average Return1.69e+3 | 14 | |
| Continuous Control | MuJoCo Ant fixed random adversary L=0.1 | Average Performance7.57e+3 | 12 | |
| Reinforcement Learning | Ant fixed linear adversary | Average Performance7.71e+3 | 12 | |
| Worst-case time-constrained reinforcement learning | Ant MuJoCo (test) | Normalized Worst-Case Reward1.58 | 12 | |
| Continuous Control | MuJoCo v2 (test) | Ant Score1.65 | 12 |