Robust Scene Change Detection Using Visual Foundation Models and Cross-Attention Mechanisms

About

We present a novel method for scene change detection that leverages the robust feature extraction capabilities of a visual foundational model, DINOv2, and integrates full-image cross-attention to address key challenges such as varying lighting, seasonal variations, and viewpoint differences. In order to effectively learn correspondences and mis-correspondences between an image pair for the change detection task, we propose to a) ``freeze'' the backbone in order to retain the generality of dense foundation features, and b) employ ``full-image'' cross-attention to better tackle the viewpoint variations between the image pair. We evaluate our approach on two benchmark datasets, VL-CMU-CD and PSCD, along with their viewpoint-varied versions. Our experiments demonstrate significant improvements in F1-score, particularly in scenarios involving geometric changes between image pairs. The results indicate our method's superior generalization capabilities over existing state-of-the-art approaches, showing robustness against photometric and geometric variations as well as better overall generalization when fine-tuned to adapt to new environments. Detailed ablation studies further validate the contributions of each component in our architecture. Our source code is available at: https://github.com/ChadLin9596/Robust-Scene-Change-Detection.

Chun-Jung Lin, Sourav Garg, Tat-Jun Chin, Feras Dayoub• 2024

Related benchmarks

Task	Dataset	Result
Semantic Change Detection	TSUNAMI	--	25
Scene Change Detection	VL-CMU-CD	F1 Score79.5	17
Scene Change Detection	PASLCD (test)	mIoU11.8	14
Change Detection	VL-CMU-CD 1 (test)	Aligned F176	10
Scene Change Detection	PSCD	S64.5	10
Scene Change Detection	ChangeSim	S Score71.8	10
Scene Change Detection	VL-CMU-CD Aligned	F1 Score79.5	7
Scene Change Detection	VL-CMU-CD Diff-2	F1 Score73.9	6
Binary Change Detection	VL-CMU-CD	F1 Score83	6
Binary Change Detection	PSCD	F1 Score54	6

Showing 10 of 14 rows

Other info

Code

Follow for update

@wizwand_team Discord