Graph-CoVis: GNN-based Multi-view Panorama Global Pose Estimation

About

In this paper, we address the problem of wide-baseline camera pose estimation from a group of 360$^\circ$ panoramas under upright-camera assumption. Recent work has demonstrated the merit of deep-learning for end-to-end direct relative pose regression in 360$^\circ$ panorama pairs [11]. To exploit the benefits of multi-view logic in a learning-based framework, we introduce Graph-CoVis, which non-trivially extends CoVisPose [11] from relative two-view to global multi-view spherical camera pose estimation. Graph-CoVis is a novel Graph Neural Network based architecture that jointly learns the co-visible structure and global motion in an end-to-end and fully-supervised approach. Using the ZInD [4] dataset, which features real homes presenting wide-baselines, occlusion, and limited visual overlap, we show that our model performs competitively to state-of-the-art approaches.

Negar Nejatishahidin, Will Hutchcroft, Manjunath Narayana, Ivaylo Boyadzhiev, Yuguang Li, Naji Khosravan, Jana Kosecka, Sing Bing Kang• 2023

Related benchmarks

Task	Dataset	Result
Global Pose Estimation	ZInD	Rot Error Mean (deg)2	6
Camera Pose and Layout Estimation	ZInD 0.6 imgs/rm	Pose Rotation Error Mean0.24	4
Camera Pose and Layout Estimation	ZInD 1 imgs/rm	Pose Rotation Mean Error0.26	4
Camera Pose and Layout Estimation	ZInD 2 imgs/rm	Rotation Error Mean (°)0.26	4
Camera Pose and Layout Estimation	ZInD 3 imgs/rm	Rotation Error Mean (°)0.23	4

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord