Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AMB3R: Accurate Feed-forward Metric-scale 3D Reconstruction with Backend

About

We present AMB3R, a multi-view feed-forward model for dense 3D reconstruction on a metric-scale that addresses diverse 3D vision tasks. The key idea is to leverage a sparse, yet compact, volumetric scene representation as our backend, enabling geometric reasoning with spatial compactness. Although trained solely for multi-view reconstruction, we demonstrate that AMB3R can be seamlessly extended to uncalibrated visual odometry (online) or large-scale structure from motion without the need for task-specific fine-tuning or test-time optimization. Compared to prior pointmap-based models, our approach achieves state-of-the-art performance in camera pose, depth, and metric-scale estimation, 3D reconstruction, and even surpasses optimization-based SLAM and SfM methods with dense reconstruction priors on common benchmarks.

Hengyi Wang, Lourdes Agapito• 2025

Related benchmarks

TaskDatasetResultRank
3D Geometry Estimation and ReconstructionSpatialBench Sparse
AbsRel0.088
42
3D Geometry Estimation and ReconstructionSpatialBench Medium
AbsRel0.085
42
3D Geometry Estimation and ReconstructionSpatialBench Average across settings
Absolute Relative Error21.3
42
3D Geometry Estimation and ReconstructionSpatialBench Single Frame
AbsRel0.466
42
Showing 4 of 4 rows

Other info

Follow for update