Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

See Better Before Looking Closer: Weakly Supervised Data Augmentation Network for Fine-Grained Visual Classification

About

Data augmentation is usually adopted to increase the amount of training data, prevent overfitting and improve the performance of deep models. However, in practice, random data augmentation, such as random image cropping, is low-efficiency and might introduce many uncontrolled background noises. In this paper, we propose Weakly Supervised Data Augmentation Network (WS-DAN) to explore the potential of data augmentation. Specifically, for each training image, we first generate attention maps to represent the object's discriminative parts by weakly supervised learning. Next, we augment the image guided by these attention maps, including attention cropping and attention dropping. The proposed WS-DAN improves the classification accuracy in two folds. In the first stage, images can be seen better since more discriminative parts' features will be extracted. In the second stage, attention regions provide accurate location of object, which ensures our model to look at the object closer and further improve the performance. Comprehensive experiments in common fine-grained visual classification datasets show that our WS-DAN surpasses the state-of-the-art methods, which demonstrates its effectiveness.

Tao Hu, Honggang Qi, Qingming Huang, Yan Lu• 2019

Related benchmarks

TaskDatasetResultRank
Fine-grained Image ClassificationCUB200 2011 (test)
Accuracy89.4
536
Fine-grained Image ClassificationStanford Cars (test)
Accuracy94.5
348
Fine-grained visual classificationFGVC-Aircraft (test)
Top-1 Acc93
287
Fine-grained Image ClassificationCUB-200 2011
Accuracy89.4
222
Fine-grained Image ClassificationStanford Cars
Accuracy94.5
206
Fine-grained Image ClassificationStanford Dogs (test)
Accuracy92.2
117
Image ClassificationStanford Dogs (test)
Top-1 Acc92.2
85
Object LocalizationCUB-200-2011 (test)--
68
Fine-grained Visual CategorizationStanford Dogs
Accuracy92.2
51
Fine-grained visual classificationFGVC Aircraft
Top-1 Accuracy93
41
Showing 10 of 16 rows

Other info

Follow for update