Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Grafit: Learning fine-grained image representations with coarse labels

About

This paper tackles the problem of learning a finer representation than the one provided by training labels. This enables fine-grained category retrieval of images in a collection annotated with coarse labels only. Our network is learned with a nearest-neighbor classifier objective, and an instance loss inspired by self-supervised learning. By jointly leveraging the coarse labels and the underlying fine-grained latent space, it significantly improves the accuracy of category-level retrieval methods. Our strategy outperforms all competing methods for retrieving or classifying images at a finer granularity than that available at train time. It also improves the accuracy for transfer learning tasks to fine-grained datasets, thereby establishing the new state of the art on five public benchmarks, like iNaturalist-2018.

Hugo Touvron, Alexandre Sablayrolles, Matthijs Douze, Matthieu Cord, Herv\'e J\'egou• 2020

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet-1k (val)
Top-1 Accuracy79.6
1453
Image ClassificationImageNet-1K
Top-1 Acc79.6
524
Image ClassificationStanford Cars
Accuracy94.7
477
Image ClassificationImageNet
Top-1 Accuracy79.6
429
Fine-grained Image ClassificationStanford Cars (test)
Accuracy94.7
348
Image ClassificationStanford Cars (test)
Accuracy92.5
306
Image ClassificationiNaturalist 2018
Top-1 Accuracy81.2
287
Image ClassificationOxford Flowers 102
Accuracy99
172
Image ClassificationFlowers-102
Top-1 Acc99.1
141
Image ClassificationOxford Flowers-102 (test)
Top-1 Accuracy99.1
131
Showing 10 of 35 rows

Other info

Follow for update