Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Cross-X Learning for Fine-Grained Visual Categorization

About

Recognizing objects from subcategories with very subtle differences remains a challenging task due to the large intra-class and small inter-class variation. Recent work tackles this problem in a weakly-supervised manner: object parts are first detected and the corresponding part-specific features are extracted for fine-grained classification. However, these methods typically treat the part-specific features of each image in isolation while neglecting their relationships between different images. In this paper, we propose Cross-X learning, a simple yet effective approach that exploits the relationships between different images and between different network layers for robust multi-scale feature learning. Our approach involves two novel components: (i) a cross-category cross-semantic regularizer that guides the extracted features to represent semantic parts and, (ii) a cross-layer regularizer that improves the robustness of multi-scale features by matching the prediction distribution across multiple layers. Our approach can be easily trained end-to-end and is scalable to large datasets like NABirds. We empirically analyze the contributions of different components of our approach and demonstrate its robustness, effectiveness and state-of-the-art performance on five benchmark datasets. Code is available at \url{https://github.com/cswluo/CrossX}.

Wei Luo, Xitong Yang, Xianjie Mo, Yuheng Lu, Larry S. Davis, Jun Li, Jian Yang, Ser-Nam Lim• 2019

Related benchmarks

TaskDatasetResultRank
Fine-grained Image ClassificationCUB200 2011 (test)
Accuracy87.7
536
Image ClassificationStanford Cars (test)
Accuracy94.6
306
Fine-grained visual classificationFGVC-Aircraft (test)
Top-1 Acc92.6
287
Image ClassificationCUB-200-2011 (test)
Top-1 Acc87.7
276
Image ClassificationFGVC-Aircraft (test)
Accuracy92.7
231
Fine-grained Image ClassificationCUB-200 2011
Accuracy87.7
222
Fine-grained Image ClassificationStanford Cars
Accuracy94.6
206
Fine-grained visual classificationNABirds (test)
Top-1 Accuracy86.4
157
Fine-grained Image ClassificationStanford Dogs (test)
Accuracy88.9
117
Fine-grained Visual CategorizationStanford Cars (test)
Accuracy94.6
110
Showing 10 of 19 rows

Other info

Code

Follow for update