Unified Panoramic Geometry Estimation via Multi-View Foundation Models
About
Geometry estimation from perspective images has greatly advanced, maturing to the point where off-the-shelf foundation models are able to reconstruct 3D scene structure not only from multi-view imagery, but even from a single view. A natural extension is 3D reconstruction from panoramas, with the exciting prospect of recovering a full 360-degree scene from a single panoramic image. In this work, we introduce PaGeR (Panoramic Geometry Reconstruction), a framework to lift powerful 3D foundation models designed for perspective imagery to the panorama domain. Our strategy is to start from a pre-trained transformer for 3D reconstruction and turn it into a unified high-performance model that predicts scale-invariant depth, metric depth, surface normals, and sky masks from both perspective and omnidirectional images, in a single forward pass. By keeping architectural changes to a minimum and mixing perspective and panoramic images during training, PaGeR retains the rich 3D prior of the underlying foundation model while learning to also estimate geometrically consistent 360-degree scenes from single panoramas. We extensively test our method in both indoor and outdoor environments and find that it delivers state-of-the-art performance and excellent zero-shot performance across a wide range of scenes. Code, data and models are available $\href{https://github.com/prs-eth/PaGeR}{\text{here}}$.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Monocular Depth Estimation | Stanford2D3D (test) | δ1 Accuracy90.94 | 86 | |
| Depth Estimation | Stanford2D3D | Abs Rel5.93 | 37 | |
| Surface Normal Estimation | Structured3D (test) | Mean Angular Error (deg)5.49 | 16 | |
| Panoramic Depth Estimation | Matterport3D360 | Absolute Relative Error (AbsRel)9.67 | 10 | |
| Panoramic Depth Estimation | ZüriPano | Absolute Relative Error (AbsRel)9.36 | 10 | |
| Panoramic metric depth estimation | Matterport3D 360 (test) | AbsRel21.83 | 5 | |
| Panoramic metric depth estimation | ZüriPano (test) | AbsRel31.97 | 5 |