Classification-Specific Parts for Improving Fine-Grained Visual Categorization
About
Fine-grained visual categorization is a classification task for distinguishing categories with high intra-class and small inter-class variance. While global approaches aim at using the whole image for performing the classification, part-based solutions gather additional local information in terms of attentions or parts. We propose a novel classification-specific part estimation that uses an initial prediction as well as back-propagation of feature importance via gradient computations in order to estimate relevant image regions. The subsequently detected parts are then not only selected by a-posteriori classification knowledge, but also have an intrinsic spatial extent that is determined automatically. This is in contrast to most part-based approaches and even to available ground-truth part annotations, which only provide point coordinates and no additional scale information. We show in our experiments on various widely-used fine-grained datasets the effectiveness of the mentioned part selection method in conjunction with the extracted part features.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Fine-grained Image Classification | CUB200 2011 (test) | Accuracy90.9 | 536 | |
| Fine-grained Image Classification | Stanford Cars (test) | Accuracy92.5 | 348 | |
| Fine-grained Image Classification | CUB-200 2011 | Accuracy89.5 | 222 | |
| Fine-grained Image Classification | Stanford Cars | Accuracy92.5 | 206 | |
| Fine-grained visual classification | NABirds (test) | Top-1 Accuracy89.9 | 157 | |
| Fine-grained Image Classification | Stanford Dogs (test) | Accuracy77.8 | 117 | |
| Image Classification | Birdsnap (test) | Top-1 Acc84 | 44 | |
| Fine grained classification | Flower102 (test) | Accuracy96.9 | 27 | |
| Fine-grained Image Classification | EU-Moths (test) | Accuracy91 | 4 |