Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LOCORE: Image Re-ranking with Long-Context Sequence Modeling

About

We introduce LOCORE, Long-Context Re-ranker, a model that takes as input local descriptors corresponding to an image query and a list of gallery images and outputs similarity scores between the query and each gallery image. This model is used for image retrieval, where typically a first ranking is performed with an efficient similarity measure, and then a shortlist of top-ranked images is re-ranked based on a more fine-grained similarity measure. Compared to existing methods that perform pair-wise similarity estimation with local descriptors or list-wise re-ranking with global descriptors, LOCORE is the first method to perform list-wise re-ranking with local descriptors. To achieve this, we leverage efficient long-context sequence models to effectively capture the dependencies between query and gallery images at the local-descriptor level. During testing, we process long shortlists with a sliding window strategy that is tailored to overcome the context size limitations of sequence models. Our approach achieves superior performance compared with other re-rankers on established image retrieval benchmarks of landmarks (ROxf and RPar), products (SOP), fashion items (In-Shop), and bird species (CUB-200) while having comparable latency to the pair-wise local descriptor re-rankers.

Zilin Xiao, Pavel Suma, Ayush Sachdeva, Hao-Jen Wang, Giorgos Kordopatis-Zilos, Giorgos Tolias, Vicente Ordonez• 2025

Related benchmarks

TaskDatasetResultRank
Image RetrievalRevisited Oxford (ROxf) + R1M (Medium)
mAP86.5
95
Image RetrievalRevisited Oxford (ROxf) + R1M (Hard)
mAP76.3
83
Image RetrievalRevisited Paris (RPar) + R1M (Hard)
mAP78.2
82
Image RetrievalRevisited Paris (RPar) + R1M (Medium)
mAP86.8
74
Image RetrievalSOP (test)
Recall@183.8
42
Image RetrievalIn-Shop (test)
Recall@189.4
38
Image RetrievalRPar+R1M Medium
mAP87.2
31
Image RetrievalRPar+R1M Hard
mAP76.9
31
Image RetrievalRevisited Oxford (ROxf) v1 (Medium)
mAP92
29
Image RetrievalRevisited Paris (RPar) Medium v1
mAP93.8
29
Showing 10 of 15 rows

Other info

Code

Follow for update