Learning with Average Precision: Training Image Retrieval with a Listwise Loss
About
Image retrieval can be formulated as a ranking problem where the goal is to order database images by decreasing similarity to the query. Recent deep models for image retrieval have outperformed traditional methods by leveraging ranking-tailored loss functions, but important theoretical and practical problems remain. First, rather than directly optimizing the global ranking, they minimize an upper-bound on the essential loss, which does not necessarily result in an optimal mean average precision (mAP). Second, these methods require significant engineering efforts to work well, e.g. special pre-training and hard-negative mining. In this paper we propose instead to directly optimize the global mAP by leveraging recent advances in listwise loss formulations. Using a histogram binning approximation, the AP can be differentiated and thus employed to end-to-end learning. Compared to existing losses, the proposed method considers thousands of images simultaneously at each iteration and eliminates the need for ad hoc tricks. It also establishes a new state of the art on many standard retrieval benchmarks. Models and evaluation scripts have been made available at https://europe.naverlabs.com/Deep-Image-Retrieval/
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Retrieval | CUB-200-2011 (test) | Recall@161.2 | 251 | |
| Visual Place Recognition | Pitts30k | Recall@175.3 | 164 | |
| Visual Place Recognition | Tokyo24/7 | Recall@140.3 | 146 | |
| Image Retrieval | Revisited Oxford (ROxf) (Medium) | mAP73.6 | 124 | |
| Image Retrieval | Revisited Paris (RPar) (Hard) | mAP71.5 | 115 | |
| Visual Place Recognition | Nordland | Recall@15.6 | 112 | |
| Image Retrieval | Revisited Paris (RPar) (Medium) | mAP85.7 | 100 | |
| Image Retrieval | Revisited Oxford (ROxf) + R1M (Medium) | mAP60.6 | 95 | |
| Image Retrieval | Revisited Oxford (ROxf) + R1M (Hard) | mAP32.7 | 83 | |
| Image Retrieval | Revisited Paris (RPar) + R1M (Hard) | mAP44.4 | 82 |