Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning and aggregating deep local descriptors for instance-level recognition

About

We propose an efficient method to learn deep local descriptors for instance-level recognition. The training only requires examples of positive and negative image pairs and is performed as metric learning of sum-pooled global image descriptors. At inference, the local descriptors are provided by the activations of internal components of the network. We demonstrate why such an approach learns local descriptors that work well for image similarity estimation with classical efficient match kernel methods. The experimental validation studies the trade-off between performance and memory requirements of the state-of-the-art image search approach based on match kernels. Compared to existing local descriptors, the proposed ones perform better in two instance-level recognition tasks and keep memory requirements lower. We experimentally show that global descriptors are not effective enough at large scale and that local descriptors are essential. We achieve state-of-the-art performance, in some cases even with a backbone network as small as ResNet18.

Giorgos Tolias, Tomas Jenicek, Ond\v{r}ej Chum• 2020

Related benchmarks

TaskDatasetResultRank
Image RetrievalRevisited Oxford (ROxf) (Medium)
mAP75.5
124
Image RetrievalRevisited Paris (RPar) (Medium)
mAP81.4
100
Image RetrievalParis Revisited (Medium)
mAP81.4
63
Visual LocalizationAachen Day-Night v1.1 (Night)
Success Rate (0.25m, 2°)72.8
58
Visual LocalizationAachen Day-Night v1.1 (Day)
SR (0.25m, 2°)90.8
50
Instance-level searchROxford (test)--
36
Image RetrievalTokyo 24/7 (test)
mAP89.2
34
Image RetrievalRevisited Oxford (ROxf) Hard 1.0 (test)
mAP56.9
20
Instance-level searchROxford + R1M (test)
mAP (Medium)65.8
19
Image Copy DetectionDISC 2021 (val)
µAP17.3
14
Showing 10 of 24 rows

Other info

Code

Follow for update