Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

End-to-end Learning of a Fisher Vector Encoding for Part Features in Fine-grained Recognition

About

Part-based approaches for fine-grained recognition do not show the expected performance gain over global methods, although explicitly focusing on small details that are relevant for distinguishing highly similar classes. We assume that part-based methods suffer from a missing representation of local features, which is invariant to the order of parts and can handle a varying number of visible parts appropriately. The order of parts is artificial and often only given by ground-truth annotations, whereas viewpoint variations and occlusions result in not observable parts. Therefore, we propose integrating a Fisher vector encoding of part features into convolutional neural networks. The parameters for this encoding are estimated by an online EM algorithm jointly with those of the neural network and are more precise than the estimates of previous works. Our approach improves state-of-the-art accuracies for three bird species classification datasets.

Dimitri Korsch, Paul Bodesheim, Joachim Denzler• 2020

Related benchmarks

TaskDatasetResultRank
Fine-grained Image ClassificationCUB200 2011 (test)
Accuracy91.2
536
Fine-grained visual classificationNABirds (test)
Top-1 Accuracy90.4
157
Fine-grained Image ClassificationStanford Dogs (test)
Accuracy79.2
117
Image ClassificationBirdsnap (test)
Top-1 Acc85.3
44
Fine-grained Image ClassificationEU-Moths (test)
Accuracy93
4
Showing 5 of 5 rows

Other info

Code

Follow for update