Population Based Augmentation: Efficient Learning of Augmentation Policy Schedules
About
A key challenge in leveraging data augmentation for neural network training is choosing an effective augmentation policy from a large search space of candidate operations. Properly chosen augmentation policies can lead to significant generalization improvements; however, state-of-the-art approaches such as AutoAugment are computationally infeasible to run for the ordinary user. In this paper, we introduce a new data augmentation algorithm, Population Based Augmentation (PBA), which generates nonstationary augmentation policy schedules instead of a fixed augmentation policy. We show that PBA can match the performance of AutoAugment on CIFAR-10, CIFAR-100, and SVHN, with three orders of magnitude less overall compute. On CIFAR-10 we achieve a mean test error of 1.46%, which is a slight improvement upon the current state-of-the-art. The code for PBA is open source and is available at https://github.com/arcelien/pba.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | CIFAR-100 (test) | Accuracy84.7 | 3518 | |
| Image Classification | CIFAR-10 (test) | Accuracy98.5 | 3381 | |
| Image Classification | CIFAR-10 (test) | -- | 906 | |
| Image Classification | SVHN (test) | Accuracy98.9 | 362 | |
| Image Classification | CIFAR100 (test) | -- | 147 | |
| Image Classification | CIFAR-10 (test) | Error Rate2.6 | 102 | |
| Image Classification | CIFAR10 (test) | Error Rate2 | 80 | |
| Image Classification | CIFAR-10 (test) | Error Rate (%)1.5 | 53 | |
| Image Classification | CIFAR-100 (test) | Test Error Rate10.9 | 15 | |
| Data augmentation policy search | CIFAR-10 | GPU Hours5 | 9 |