Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Pairwise Confusion for Fine-Grained Visual Classification

About

Fine-Grained Visual Classification (FGVC) datasets contain small sample sizes, along with significant intra-class variation and inter-class similarity. While prior work has addressed intra-class variation using localization and segmentation techniques, inter-class similarity may also affect feature learning and reduce classification performance. In this work, we address this problem using a novel optimization procedure for the end-to-end neural network training on FGVC tasks. Our procedure, called Pairwise Confusion (PC) reduces overfitting by intentionally {introducing confusion} in the activations. With PC regularization, we obtain state-of-the-art performance on six of the most widely-used FGVC datasets and demonstrate improved localization ability. {PC} is easy to implement, does not need excessive hyperparameter tuning during training, and does not add significant overhead during test time.

Abhimanyu Dubey, Otkrist Gupta, Pei Guo, Ramesh Raskar, Ryan Farrell, Nikhil Naik• 2017

Related benchmarks

TaskDatasetResultRank
Fine-grained Image ClassificationCUB200 2011 (test)
Accuracy86.9
543
Fine-grained Image ClassificationStanford Cars (test)
Accuracy93.43
348
Image ClassificationStanford Cars (test)
Accuracy93.43
316
Fine-grained visual classificationFGVC-Aircraft (test)
Top-1 Acc89.8
312
Image ClassificationFGVC-Aircraft (test)
Accuracy89.2
305
Fine-grained Image ClassificationCUB-200 2011
Accuracy87.7
300
Fine-grained Image ClassificationStanford Cars
Accuracy94.3
284
Fine-grained visual classificationNABirds (test)
Top-1 Accuracy82.8
157
Fine-grained Image ClassificationStanford Dogs (test)
Accuracy83.8
124
Fine-grained Visual CategorizationStanford Cars (test)
Accuracy92.9
114
Showing 10 of 25 rows

Other info

Code

Follow for update