Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Re-rank Coarse Classification with Local Region Enhanced Features for Fine-Grained Image Recognition

About

Fine-grained image recognition is very challenging due to the difficulty of capturing both semantic global features and discriminative local features. Meanwhile, these two features are not easy to be integrated, which are even conflicting when used simultaneously. In this paper, a retrieval-based coarse-to-fine framework is proposed, where we re-rank the TopN classification results by using the local region enhanced embedding features to improve the Top1 accuracy (based on the observation that the correct category usually resides in TopN results). To obtain the discriminative regions for distinguishing the fine-grained images, we introduce a weakly-supervised method to train a box generating branch with only image-level labels. In addition, to learn more effective semantic global features, we design a multi-level loss over an automatically constructed hierarchical category structure. Experimental results show that our method achieves state-of-the-art performance on three benchmarks: CUB-200-2011, Stanford Cars, and FGVC Aircraft. Also, visualizations and analysis are provided for better understanding.

Shaokang Yang, Shuai Liu, Cheng Yang, Changhu Wang• 2021

Related benchmarks

TaskDatasetResultRank
Fine-grained Image ClassificationCUB200 2011 (test)
Accuracy91.1
536
Fine-grained Image ClassificationCUB-200 2011
Accuracy91.1
222
Fine-grained Image ClassificationStanford Cars
Accuracy95.49
206
Fine-grained visual classificationFGVC Aircraft
Top-1 Accuracy94.1
41
Showing 4 of 4 rows

Other info

Follow for update