SurGe: Improved Surface Geometry in Point Maps
About
Recent feedforward 3D reconstruction methods predict point maps and estimate global 3D geometry remarkably well. However, their predictions still exhibit inaccurate local surface geometry, which is clearly visible qualitatively but only weakly reflected in common metrics. To make these errors more explicit in evaluation, we introduce a point map normal metric that evaluates the local surface orientation induced by neighboring 3D predictions. To reduce these errors, we propose two complementary components: a point gradient matching loss that supervises depth-normalized 3D finite differences, and a Neighborhood Attention Decoder (NAD) that progressively upsamples features and uses Neighborhood Attention for local feature mixing. Across eight zero-shot monocular geometry benchmarks, our model, SurGe, achieves the best average rank for global point map AbsRel and consistently improves local point map and point map normal evaluations.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Surface Normal Estimation | iBIMS-1 | MAE16.5 | 34 | |
| Surface Normal Estimation | DIODE | Mean Angle Error12 | 27 | |
| Video Surface Normal Estimation | Sintel | -- | 25 | |
| Depth Estimation | NYU V2 | Global Region AbsRel3.31 | 21 | |
| Point Map Estimation | KITTI | Abs Rel0.048 | 19 | |
| Global Point Map Estimation | ETH3D | AbsRel (Global)3.51 | 12 | |
| Global Point Map Estimation | iBIMS-1 | AbsRel (Global)3.31 | 12 | |
| Global Point Map Estimation | DDAD | Absolute Relative Error (Global)9.05 | 12 | |
| Local Point Map Prediction | ETH3D | AbsRel Loc Error2.66 | 12 | |
| Local Point Map Prediction | iBIMS-1 | AbsRel (Local)3.33 | 12 |