Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Unifying Deep Local and Global Features for Image Search

About

Image retrieval is the problem of searching an image database for items that are similar to a query image. To address this task, two main types of image representations have been studied: global and local image features. In this work, our key contribution is to unify global and local features into a single deep model, enabling accurate retrieval with efficient feature extraction. We refer to the new model as DELG, standing for DEep Local and Global features. We leverage lessons from recent feature learning work and propose a model that combines generalized mean pooling for global features and attentive selection for local features. The entire network can be learned end-to-end by carefully balancing the gradient flow between two heads -- requiring only image-level labels. We also introduce an autoencoder-based dimensionality reduction technique for local features, which is integrated into the model, improving training efficiency and matching performance. Comprehensive experiments show that our model achieves state-of-the-art image retrieval on the Revisited Oxford and Paris datasets, and state-of-the-art single-model instance-level recognition on the Google Landmarks dataset v2. Code and models are available at https://github.com/tensorflow/models/tree/master/research/delf .

Bingyi Cao, Andre Araujo, Jack Sim• 2020

Related benchmarks

TaskDatasetResultRank
Visual Place RecognitionMSLS (val)
Recall@183.9
236
Visual Place RecognitionPitts30k
Recall@189.9
164
Visual Place RecognitionTokyo24/7
Recall@195.9
146
Visual Place RecognitionMSLS Challenge
Recall@156.5
134
Image RetrievalRevisited Oxford (ROxf) (Medium)
mAP81.2
124
Image RetrievalRevisited Paris (RPar) (Hard)
mAP72.8
115
Image RetrievalRevisited Paris (RPar) (Medium)
mAP87.2
100
Image RetrievalRevisited Oxford (ROxf) + R1M (Medium)
mAP69.1
95
Visual Place RecognitionPittsburgh30k (test)
Recall@189.9
86
Image RetrievalRevisited Oxford (ROxf) + R1M (Hard)
mAP47.5
83
Showing 10 of 36 rows

Other info

Follow for update