Active Domain Randomization
About
Domain randomization is a popular technique for improving domain transfer, often used in a zero-shot setting when the target domain is unknown or cannot easily be used for training. In this work, we empirically examine the effects of domain randomization on agent generalization. Our experiments show that domain randomization may lead to suboptimal, high-variance policies, which we attribute to the uniform sampling of environment parameters. We propose Active Domain Randomization, a novel algorithm that learns a parameter sampling strategy. Our method looks for the most informative environment variations within the given randomization ranges by leveraging the discrepancies of policy rollouts in randomized and reference environment instances. We find that training more frequently on these instances leads to better overall agent generalization. Our experiments across various physics-based simulated and real-robot tasks show that this enhancement leads to more robust, consistent policies.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Peg Insertion | Real-world | Success Rate55.8 | 25 | |
| Pick-&-Place | Real-world | Success Rate78.2 | 15 | |
| Tool Usage | Real-world tool usage | Success Rate38.2 | 13 | |
| Average Manipulation Performance | Real-world | Average Success Rate54.2 | 9 | |
| In-Hand Rotation | Real-world | Success Rate42.5 | 9 | |
| Pouring | Real-world | Success Rate45.3 | 9 | |
| Stacking | Real-world | Success Rate65.1 | 9 | |
| pick place | DexSim2Real Six-Task Benchmark | Success Rate (Sim)94.1 | 3 | |
| Pouring | DexSim2Real Six-Task Benchmark | Sim Performance62.5 | 3 | |
| Average (Six-Task) | DexSim2Real Six-Task | Sim Performance73.4 | 3 |