Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Attentive CutMix: An Enhanced Data Augmentation Approach for Deep Learning Based Image Classification

About

Convolutional neural networks (CNN) are capable of learning robust representation with different regularization methods and activations as convolutional layers are spatially correlated. Based on this property, a large variety of regional dropout strategies have been proposed, such as Cutout, DropBlock, CutMix, etc. These methods aim to promote the network to generalize better by partially occluding the discriminative parts of objects. However, all of them perform this operation randomly, without capturing the most important region(s) within an object. In this paper, we propose Attentive CutMix, a naturally enhanced augmentation strategy based on CutMix. In each training iteration, we choose the most descriptive regions based on the intermediate attention maps from a feature extractor, which enables searching for the most discriminative parts in an image. Our proposed method is simple yet effective, easy to implement and can boost the baseline significantly. Extensive experiments on CIFAR-10/100, ImageNet datasets with various CNN architectures (in a unified setting) demonstrate the effectiveness of our proposed method, which consistently outperforms the baseline CutMix and other methods by a significant margin.

Devesh Walawalkar, Zhiqiang Shen, Zechun Liu, Marios Savvides• 2020

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet-1k (val)
Top-1 Accuracy78.19
1453
Fine-grained Image ClassificationStanford Cars (test)
Accuracy90.29
348
Image ClassificationStanford Cars (test)--
306
Fine-grained visual classificationFGVC-Aircraft (test)
Top-1 Acc76.06
287
Fine-grained Image ClassificationCUB-200 (test)
Accuracy87.47
45
Image ClassificationCIFAR100 (test)
Top-1 Accuracy (0% Corruption)80.27
32
Showing 6 of 6 rows

Other info

Follow for update