Visual Instance Retrieval with Deep Convolutional Networks
About
This paper provides an extensive study on the availability of image representations based on convolutional networks (ConvNets) for the task of visual instance retrieval. Besides the choice of convolutional layers, we present an efficient pipeline exploiting multi-scale schemes to extract local features, in particular, by taking geometric invariance into explicit account, i.e. positions, scales and spatial consistency. In our experiments using five standard image retrieval datasets, we demonstrate that generic ConvNet image representations can outperform other state-of-the-art methods if they are extracted appropriately.
Ali Sharif Razavian, Josephine Sullivan, Stefan Carlsson, Atsuto Maki• 2014
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Retrieval | Holidays | mAP71.6 | 115 | |
| Image Retrieval | Oxford5k (test) | mAP56.4 | 97 | |
| Image Retrieval | Paris6k (test) | mAP94.9 | 88 | |
| Image Retrieval | Oxford105k (test) | mAP85.7 | 56 | |
| Image Retrieval | Paris106k (test) | mAP58 | 26 | |
| Image Retrieval | Holidays standard (test) | mAP79 | 25 | |
| Image Retrieval | Oxford5k original (test) | mAP88.1 | 18 | |
| Image Retrieval | Paris106k large-scale (test) | mAP91.3 | 18 | |
| Image Retrieval | Oxford5K full query | mAP53.3 | 8 | |
| Image Retrieval | Holidays 101k (test) | mAP66.1 | 5 |
Showing 10 of 11 rows