How to Choose Your Teacher for Fine Grained Image Recognition
About
Fine-grained image recognition classifies subcategories such as bird species or car models. While state-of-the-art (SOTA) models are accurate, they are often too resource-intensive for deployment on constrained devices. Knowledge distillation addresses this by transferring knowledge from a large teacher model to a smaller student model. A key challenge is selecting the right teacher, as it heavily impacts student performance. This paper introduces a teacher selection metric, \textbf{Ratio 1-2}, based on teacher prediction ratios. Extensive analysis of over one thousand experiments across 3 students, 8 teachers, and 8 datasets under 4 training strategies demonstrates that our metric improves teacher selection by 18\% over previous methods, enabling small student models to achieve up to 17\% accuracy gains. Experiment codebase is available at: \href{https://github.com/arkel23/FGIR-KD-Teacher}{https://github.com/arkel23/FGIR-KD-Teacher}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Classification | Cars | Accuracy82.5 | 492 | |
| Image Classification | CUB | Accuracy73.5 | 331 | |
| Image Classification | Flowers | Accuracy88.6 | 135 | |
| Image Classification | Dogs | Accuracy68 | 72 | |
| Image Classification | NABirds | Accuracy67.8 | 63 | |
| Image Classification | Aircraft | Accuracy85.2 | 58 | |
| Image Classification | Moe | Accuracy95.2 | 4 |