Maximum-Entropy Fine-Grained Classification

About

Fine-Grained Visual Classification (FGVC) is an important computer vision problem that involves small diversity within the different classes, and often requires expert annotators to collect data. Utilizing this notion of small visual diversity, we revisit Maximum-Entropy learning in the context of fine-grained classification, and provide a training routine that maximizes the entropy of the output probability distribution for training convolutional neural networks on FGVC tasks. We provide a theoretical as well as empirical justification of our approach, and achieve state-of-the-art performance across a variety of classification tasks in FGVC, that can potentially be extended to any fine-tuning task. Our method is robust to different hyperparameter values, amount of training data and amount of training label noise and can hence be a valuable tool in many similar problems.

Abhimanyu Dubey, Otkrist Gupta, Ramesh Raskar, Nikhil Naik• 2018

Related benchmarks

Task	Dataset	Result
Fine-grained Image Classification	CUB200 2011 (test)	Accuracy86.54	585
Fine-grained Image Classification	Stanford Cars (test)	Accuracy93	372
Image Classification	FGVC-Aircraft (test)	Accuracy89.8	322
Image Classification	Stanford Cars (test)	Accuracy93.9	320
Fine-grained Image Classification	CUB-200 2011	Accuracy86.6	317
Image Classification	CUB-200-2011 (test)	Top-1 Acc86.6	316
Fine-grained visual classification	FGVC-Aircraft (test)	Top-1 Acc89.76	312
Fine-grained Image Classification	Stanford Cars	Accuracy93.9	298
Fine-grained visual classification	NABirds (test)	Top-1 Accuracy83	157
Image Classification	Stanford Dogs (test)	Top-1 Acc84.9	140

Showing 10 of 22 rows

Other info

Follow for update

@wizwand_team Discord