Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

WorldMirror: Universal 3D World Reconstruction with Any-Prior Prompting

About

We present WorldMirror, an all-in-one, feed-forward model for versatile 3D geometric prediction tasks. Unlike existing methods constrained to image-only inputs or customized for a specific task, our framework flexibly integrates diverse geometric priors, including camera poses, intrinsics, and depth maps, while simultaneously generating multiple 3D representations: dense point clouds, multi-view depth maps, camera parameters, surface normals, and 3D Gaussians. This elegant and unified architecture leverages available prior information to resolve structural ambiguities and delivers geometrically consistent 3D outputs in a single forward pass. WorldMirror achieves state-of-the-art performance across diverse benchmarks from camera, point map, depth, and surface normal estimation to novel view synthesis, while maintaining the efficiency of feed-forward inference. Code and models will be publicly available soon.

Yifan Liu, Zhiyuan Min, Zhenwei Wang, Junta Wu, Tengfei Wang, Yixuan Yuan, Yawei Luo, Chunchao Guo• 2025

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisRealEstate10K
PSNR25.54
173
Camera pose estimationTUM-dynamic
ATE0.01
163
Novel View SynthesisRE10K
SSIM70.7
142
Monocular Depth EstimationNYU V2
Delta 1 Acc96.8
131
Novel View SynthesisDL3DV
PSNR21.76
84
Monocular Depth EstimationScanNet
AbsRel5.2
81
Novel View SynthesisRe10K (test)
PSNR22.11
79
Novel View SynthesisRE10K challenging views (test)
PSNR20.55
56
Point Map Estimation7 Scenes--
47
Novel View SynthesisACID 20 (test)
PSNR22.2
24
Showing 10 of 31 rows

Other info

Follow for update