Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Graph-CoVis: GNN-based Multi-view Panorama Global Pose Estimation

About

In this paper, we address the problem of wide-baseline camera pose estimation from a group of 360$^\circ$ panoramas under upright-camera assumption. Recent work has demonstrated the merit of deep-learning for end-to-end direct relative pose regression in 360$^\circ$ panorama pairs [11]. To exploit the benefits of multi-view logic in a learning-based framework, we introduce Graph-CoVis, which non-trivially extends CoVisPose [11] from relative two-view to global multi-view spherical camera pose estimation. Graph-CoVis is a novel Graph Neural Network based architecture that jointly learns the co-visible structure and global motion in an end-to-end and fully-supervised approach. Using the ZInD [4] dataset, which features real homes presenting wide-baselines, occlusion, and limited visual overlap, we show that our model performs competitively to state-of-the-art approaches.

Negar Nejatishahidin, Will Hutchcroft, Manjunath Narayana, Ivaylo Boyadzhiev, Yuguang Li, Naji Khosravan, Jana Kosecka, Sing Bing Kang• 2023

Related benchmarks

TaskDatasetResultRank
Global Pose EstimationZInD
Rot Error Mean (deg)2
6
Camera Pose and Layout EstimationZInD 0.6 imgs/rm
Pose Rotation Error Mean0.24
4
Camera Pose and Layout EstimationZInD 1 imgs/rm
Pose Rotation Mean Error0.26
4
Camera Pose and Layout EstimationZInD 2 imgs/rm
Rotation Error Mean (°)0.26
4
Camera Pose and Layout EstimationZInD 3 imgs/rm
Rotation Error Mean (°)0.23
4
Showing 5 of 5 rows

Other info

Follow for update