Maximum-Entropy Fine-Grained Classification
About
Fine-Grained Visual Classification (FGVC) is an important computer vision problem that involves small diversity within the different classes, and often requires expert annotators to collect data. Utilizing this notion of small visual diversity, we revisit Maximum-Entropy learning in the context of fine-grained classification, and provide a training routine that maximizes the entropy of the output probability distribution for training convolutional neural networks on FGVC tasks. We provide a theoretical as well as empirical justification of our approach, and achieve state-of-the-art performance across a variety of classification tasks in FGVC, that can potentially be extended to any fine-tuning task. Our method is robust to different hyperparameter values, amount of training data and amount of training label noise and can hence be a valuable tool in many similar problems.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Fine-grained Image Classification | CUB200 2011 (test) | Accuracy86.54 | 536 | |
| Fine-grained Image Classification | Stanford Cars (test) | Accuracy93 | 348 | |
| Image Classification | Stanford Cars (test) | Accuracy93.9 | 306 | |
| Fine-grained visual classification | FGVC-Aircraft (test) | Top-1 Acc89.76 | 287 | |
| Image Classification | CUB-200-2011 (test) | Top-1 Acc86.6 | 276 | |
| Image Classification | FGVC-Aircraft (test) | Accuracy89.8 | 231 | |
| Fine-grained Image Classification | CUB-200 2011 | Accuracy86.6 | 222 | |
| Fine-grained Image Classification | Stanford Cars | Accuracy93.9 | 206 | |
| Fine-grained visual classification | NABirds (test) | Top-1 Accuracy83 | 157 | |
| Fine-grained Image Classification | Stanford Dogs (test) | Accuracy83.6 | 117 |