Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition

About

Recent studies show that vision models pre-trained in generic visual learning tasks with large-scale data can provide useful feature representations for a wide range of visual perception problems. However, few attempts have been made to exploit pre-trained foundation models in visual place recognition (VPR). Due to the inherent difference in training objectives and data between the tasks of model pre-training and VPR, how to bridge the gap and fully unleash the capability of pre-trained models for VPR is still a key issue to address. To this end, we propose a novel method to realize seamless adaptation of pre-trained models for VPR. Specifically, to obtain both global and local features that focus on salient landmarks for discriminating places, we design a hybrid adaptation method to achieve both global and local adaptation efficiently, in which only lightweight adapters are tuned without adjusting the pre-trained model. Besides, to guide effective adaptation, we propose a mutual nearest neighbor local feature loss, which ensures proper dense local features are produced for local matching and avoids time-consuming spatial verification in re-ranking. Experimental results show that our method outperforms the state-of-the-art methods with less training data and training time, and uses about only 3% retrieval runtime of the two-stage VPR methods with RANSAC-based spatial verification. It ranks 1st on the MSLS challenge leaderboard (at the time of submission). The code is released at https://github.com/Lu-Feng/SelaVPR.

Feng Lu, Lijun Zhang, Xiangyuan Lan, Shuting Dong, Yaowei Wang, Chun Yuan• 2024

Related benchmarks

TaskDatasetResultRank
Visual Place RecognitionMSLS (val)
Recall@190.8
236
Visual Place RecognitionPitts30k
Recall@192.8
164
Visual Place RecognitionTokyo24/7
Recall@194
146
Visual Place RecognitionMSLS Challenge
Recall@173.5
134
Visual Place RecognitionNordland
Recall@187.3
112
Visual Place RecognitionSPED
Recall@189.5
106
Visual Place RecognitionPittsburgh30k (test)
Recall@192.8
86
Visual Place RecognitionAmsterTime
Recall@155.2
83
Visual Place RecognitionSt Lucia
R@199.8
76
Visual Place RecognitionNordland
Recall@185.2
72
Showing 10 of 25 rows

Other info

Code

Follow for update