Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

UniK3D: Universal Camera Monocular 3D Estimation

About

Monocular 3D estimation is crucial for visual perception. However, current methods fall short by relying on oversimplified assumptions, such as pinhole camera models or rectified images. These limitations severely restrict their general applicability, causing poor performance in real-world scenarios with fisheye or panoramic images and resulting in substantial context loss. To address this, we present UniK3D, the first generalizable method for monocular 3D estimation able to model any camera. Our method introduces a spherical 3D representation which allows for better disentanglement of camera and scene geometry and enables accurate metric 3D reconstruction for unconstrained camera models. Our camera component features a novel, model-independent representation of the pencil of rays, achieved through a learned superposition of spherical harmonics. We also introduce an angular loss, which, together with the camera module design, prevents the contraction of the 3D outputs for wide-view cameras. A comprehensive zero-shot evaluation on 13 diverse datasets demonstrates the state-of-the-art performance of UniK3D across 3D, depth, and camera metrics, with substantial gains in challenging large-field-of-view and panoramic settings, while maintaining top accuracy in conventional pinhole small-field-of-view domains. Code and models are available at github.com/lpiccinelli-eth/unik3d .

Luigi Piccinelli, Christos Sakaridis, Mattia Segu, Yung-Hsu Yang, Siyuan Li, Wim Abbeloos, Luc Van Gool• 2025

Related benchmarks

TaskDatasetResultRank
Monocular Depth EstimationStanford2D3D (test)
δ1 Accuracy96.8
71
Monocular Depth EstimationNYU-Depth v2 (val)
A.Rel4.43
21
Monocular Depth EstimationRTX3090 480x640 efficiency (test)
Latency (ms)3.1
20
Monocular 360 Depth EstimationMatterport3D official (test)
Delta Acc (1.25x)85.8
20
Monocular Depth EstimationKITTI Eigen-split (val)
Delta 1 Accuracy99
16
Depth EstimationTUM-RGBD--
16
360 Depth EstimationStanford2D3D 1.0 (test)
Abs Rel Error0.1795
14
Depth EstimationSintel
delta1 SSI80.5
13
Metric Depth PredictionAggregate (TUM-RGBD, ScanNet, Sintel, Bonn-RGBD) (test)
δ1 Accuracy (SSI)93.6
13
Depth EstimationBonn-RGBD
Delta 1 SSI99
13
Showing 10 of 22 rows

Other info

Code

Follow for update