Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels
About
We propose a simple data augmentation technique that can be applied to standard model-free reinforcement learning algorithms, enabling robust learning directly from pixels without the need for auxiliary losses or pre-training. The approach leverages input perturbations commonly used in computer vision tasks to regularize the value function. Existing model-free approaches, such as Soft Actor-Critic (SAC), are not able to train deep networks effectively from image pixels. However, the addition of our augmentation method dramatically improves SAC's performance, enabling it to reach state-of-the-art performance on the DeepMind control suite, surpassing model-based (Dreamer, PlaNet, and SLAC) methods and recently proposed contrastive learning (CURL). Our approach can be combined with any model-free reinforcement learning algorithm, requiring only minor modifications. An implementation can be found at https://sites.google.com/view/data-regularized-q.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Continuous Control | DMControl 500k | Spin Score938 | 33 | |
| Control | DMControl | DMControl: Ball in Cup Catch Score914.9 | 29 | |
| Continuous Control | DMControl 100k | DMControl: Finger Spin Score901 | 29 | |
| Reinforcement Learning | Atari100k (test) | Alien Score771.2 | 23 | |
| Reinforcement Learning | Atari 100k steps (test) | Median HNS0.268 | 20 | |
| Continuous Control | DMC-GB video hard | Cartpole Swingup Score1.36e+4 | 18 | |
| Reinforcement Learning | Atari 100k | Alien Score734.1 | 18 | |
| Visual Reinforcement Learning | DMControl Finger, Spin | Episode Return901 | 16 | |
| Visual Reinforcement Learning | DMControl Walker Walk | Episode Return612 | 16 | |
| Visual Reinforcement Learning | DMControl Ball in cup, Catch | Episode Return913 | 16 |