Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MoGe-2: Accurate Monocular Geometry with Metric Scale and Sharp Details

About

We propose MoGe-2, an advanced open-domain geometry estimation model that recovers a metric scale 3D point map of a scene from a single image. Our method builds upon the recent monocular geometry estimation approach, MoGe, which predicts affine-invariant point maps with unknown scales. We explore effective strategies to extend MoGe for metric geometry prediction without compromising the relative geometry accuracy provided by the affine-invariant point representation. Additionally, we discover that noise and errors in real data diminish fine-grained detail in the predicted geometry. We address this by developing a unified data refinement approach that filters and completes real data from different sources using sharp synthetic labels, significantly enhancing the granularity of the reconstructed geometry while maintaining the overall accuracy. We train our model on a large corpus of mixed datasets and conducted comprehensive evaluations, demonstrating its superior performance in achieving accurate relative geometry, precise metric scale, and fine-grained detail recovery -- capabilities that no previous methods have simultaneously achieved.

Ruicheng Wang, Sicheng Xu, Yue Dong, Yu Deng, Jianfeng Xiang, Zelong Lv, Guangzhong Sun, Xin Tong, Jiaolong Yang• 2025

Related benchmarks

TaskDatasetResultRank
Monocular Depth EstimationKITTI
Abs Rel0.049
161
Monocular Depth EstimationETH3D
AbsRel3.2
117
Monocular Depth EstimationNYU V2
Delta 1 Acc98.4
113
Depth EstimationScanNet
AbsRel3.3
94
Monocular Depth EstimationDIODE
AbsRel4.8
93
Monocular Depth EstimationScanNet
AbsRel3.8
64
Depth EstimationDIODE
Delta-1 Accuracy97.4
62
Monocular Depth EstimationiBIMS-1
ARel0.0992
32
Depth EstimationHAMMER
Delta 168.5
29
Monocular Depth EstimationSintel
Abs Rel0.277
21
Showing 10 of 73 rows
...

Other info

Follow for update