Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

On Train-Test Class Overlap and Detection for Image Retrieval

About

How important is it for training and evaluation sets to not have class overlap in image retrieval? We revisit Google Landmarks v2 clean, the most popular training set, by identifying and removing class overlap with Revisited Oxford and Paris [34], the most popular evaluation set. By comparing the original and the new RGLDv2-clean on a benchmark of reproduced state-of-the-art methods, our findings are striking. Not only is there a dramatic drop in performance, but it is inconsistent across methods, changing the ranking.What does it take to focus on objects or interest and ignore background clutter when indexing? Do we need to train an object detector and the representation separately? Do we need location supervision? We introduce Single-stage Detect-to-Retrieve (CiDeR), an end-to-end, single-stage pipeline to detect objects of interest and extract a global image representation. We outperform previous state-of-the-art on both existing training sets and the new RGLDv2-clean. Our dataset is available at https://github.com/dealicious-inc/RGLDv2-clean.

Chull Hwan Song, Jooyoung Yoon, Taebaek Hwang, Shunghyun Choi, Yeong Hyeon Gu, Yannis Avrithis• 2024

Related benchmarks

TaskDatasetResultRank
Image RetrievalRevisited Oxford (ROxf) (Medium)
mAP77.8
124
Image RetrievalRevisited Paris (RPar) (Hard)
mAP75.3
115
Image RetrievalOxford 5k
mAP92.6
100
Image RetrievalRevisited Paris (RPar) (Medium)
mAP84.5
100
Image RetrievalRevisited Oxford (ROxf) (Hard)
mAP61.9
81
Image RetrievalParis Revisited (Medium)
mAP87.4
63
Image RetrievalParis6k
mAP96.1
45
Image RetrievalOxford Revisited (Hard)
mAP58.9
33
Image RetrievalRPar+R1M Medium
mAP61.6
31
Image RetrievalRPar+R1M Hard
mAP35.8
31
Showing 10 of 14 rows

Other info

Code

Follow for update