Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Modeling Depth Ambiguity: A Mixture-Density Representation for Flying-Point-Free Depth Estimation

About

Despite advances in depth estimation, flying points remain a persistent failure mode: near object boundaries, depth estimators often predict spurious 3D points in the empty space between foreground and background surfaces. We trace this artifact to a standard modeling choice: assigning each pixel a single depth hypothesis. At boundaries, a pixel can straddle a foreground and a background surface, so its true depth is ambiguous between the two. A model that predicts a single depth cannot keep both possibilities, so training instead pulls the prediction toward an intermediate depth that lies on neither surface. We address this with MDA, a mixture-density representation that lets the model predict multiple depth hypotheses and their associated probabilities for each pixel. Near boundaries, different hypotheses can align with different surfaces, and the decoded depth is selected from one of these hypotheses rather than placed in the empty space between them. Across different backbones, MDA substantially improves boundary reconstruction and largely removes flying-point artifacts even under severe input blur, while adding negligible runtime overhead. The same mixture-density framework naturally extends to transparent objects, where it predicts multiple depth layers at transparent pixels, and to sky regions, where a dedicated component separates the unbounded sky from finite-depth regions, producing flying-point-free skylines. Project Page: https://biansy000.github.io/mda-site/.

Siyuan Bian, Congrong Xu, Jun Gao• 2026

Related benchmarks

TaskDatasetResultRank
Video Depth EstimationSintel
Delta Threshold Accuracy (1.25)67.4
235
Video Depth EstimationKITTI
Abs Rel0.044
148
Video Depth EstimationBONN
AbsRel4.9
131
3D Reconstruction7 Scenes
Accuracy Median13
128
3D ReconstructionNRGBD
Accuracy Mean16
63
Boundary Quality AnalysisHiRoom Img
Accuracy45
7
Boundary Quality AnalysisHiRoom Seq
Accuracy42
7
Multi-view 3D ReconstructionHiRoom
Accuracy (Mean)24
7
Boundary Quality AnalysisNRGBD Img
Accuracy28
7
Boundary Quality AnalysisNRGBD (Seq)
Accuracy27
7
Showing 10 of 17 rows

Other info

Follow for update