FS-I2P:A Hierarchical Focus-Sweep Registration Network with Dynamically Allocated Depth

About

Image-to-point cloud registration is often challenged by viewpoint changes, cross-modal discrepancies, and repetitive textures, which induce scale ambiguity and consequently lead to erroneous correspondences. Recent detection-free methods alleviate this issue by leveraging multi-scale features and transformer-based interactions. However, they still suffer from attention drift across layers and intra-scale inconsistencies, hindering precise registration. Inspired by human behavior, we propose a ``Focus--Sweep'' paradigm and develop a Hierarchical Focus--Sweep Interaction Module within an SSM-based framework to enhance multi-level cross-modal feature association. In addition, we introduce a Dynamic Layer Allocation Strategy that adaptively determines the iteration depth to better exploit geometric constraints and improve matching robustness. Extensive experiments and ablations on two benchmarks, RGB-D Scenes V2 and 7-Scenes, demonstrate that our approach achieves state-of-the-art performance.

Zhixin Cheng, Yujia Chen, Xujing Tao, Bohao Liao, Xiaotian Yin, Baoqun Yin, Tianzhu Zhang• 2026

Related benchmarks

Task	Dataset	Result
2D/3D Registration	RGB-D Scenes v2	--	53
2D/3D Registration	7 Scenes	Registration Recall (Mean)84.6	25
Image-to-point cloud registration	KITTI	RTE (m)1.57	10

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord