Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning
About
Transferring the knowledge learned from large scale datasets (e.g., ImageNet) via fine-tuning offers an effective solution for domain-specific fine-grained visual categorization (FGVC) tasks (e.g., recognizing bird species or car make and model). In such scenarios, data annotation often calls for specialized domain knowledge and thus is difficult to scale. In this work, we first tackle a problem in large scale FGVC. Our method won first place in iNaturalist 2017 large scale species classification challenge. Central to the success of our approach is a training scheme that uses higher image resolution and deals with the long-tailed distribution of training data. Next, we study transfer learning via fine-tuning from large scale datasets to small scale, domain-specific FGVC datasets. We propose a measure to estimate domain similarity via Earth Mover's Distance and demonstrate that transfer learning benefits from pre-training on a source domain that is similar to the target domain by this measure. Our proposed transfer learning outperforms ImageNet pre-training and obtains state-of-the-art results on multiple commonly used FGVC datasets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Fine-grained Image Classification | CUB200 2011 (test) | Accuracy89.6 | 536 | |
| Image Classification | Food-101 | Accuracy90.4 | 494 | |
| Fine-grained Image Classification | Stanford Cars (test) | Accuracy93.5 | 348 | |
| Fine-grained Image Classification | CUB-200 2011 | Accuracy89.6 | 222 | |
| Fine-grained Image Classification | Stanford Cars | Accuracy93.5 | 206 | |
| Image Classification | CIFAR-10-LT (test) | Top-1 Error0.1678 | 185 | |
| Fine-grained visual classification | NABirds (test) | Top-1 Accuracy87.9 | 157 | |
| Image Classification | CIFAR-100 Long-Tailed (test) | -- | 149 | |
| Image Classification | Oxford Flowers-102 (test) | Top-1 Accuracy97.7 | 131 | |
| Fine-grained Image Classification | Stanford Dogs (test) | Accuracy78.5 | 117 |