Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Geometry-Aware Representation Denoising for Robust Multi-view 3D Reconstruction

About

Multi-view 3D reconstruction has achieved remarkable progress with the advent of feed-forward 3D reconstruction models. However, these models are typically trained and evaluated under ideal, degradation-free imaging conditions, whereas real-world observations often contain degradations that differ significantly from such settings. Improving robustness for multi-view 3D reconstruction under degraded conditions therefore remains an important challenge. We present Geometry-Aware Representation Denoising (GARD), a novel framework that performs diffusion-based multi-view restoration directly in the feature space of a feed-forward 3D reconstruction model. This design exploits the geometry-aware feature representations of the 3D reconstructor to effectively recover accurate scene geometry. Furthermore, by employing an additional RGB image decoder, the refined representations can also be used to restore high-quality RGB images, thereby enabling the simultaneous recovery of 3D scene geometry and high-quality imagery. Comprehensive experiments on the Depth Anything 3 (DA3) benchmark demonstrate the effectiveness of the proposed GARD framework.

Jin Hyeon Kim, Jaeeun Lee, Claire Kim, Kyoungjin Oh, Paul Hyunbin Cho, Jaewon Min, Yeji Choi, Jihye Park, Hyunhee Park, Minkyu Park, Seungryong Kim• 2026

Related benchmarks

TaskDatasetResultRank
3D Reconstruction7 Scenes--
128
3D ReconstructionDTU--
55
3D ReconstructionETH3D
F1 Score20.38
35
Camera pose estimation7Scenes DA3 benchmark
AUC535.55
20
Camera pose estimationScanNet++ DA3
AUC556.44
20
Camera pose estimationHiRoom DA3 benchmark
AUC512
20
Depth Estimation7Scenes DA3 benchmark BASE (test)
AbsRel0.071
20
3D ReconstructionHiRoom
F1 Score12.69
18
Pose EstimationETH3D DA3 (test)
AUC@3074.68
12
3D ReconstructionHiRoom DA3 (test)
Overall Score0.293
10
Showing 10 of 40 rows

Other info

GitHub

Follow for update