Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

G3R: Gradient Guided Generalizable Reconstruction

About

Large scale 3D scene reconstruction is important for applications such as virtual reality and simulation. Existing neural rendering approaches (e.g., NeRF, 3DGS) have achieved realistic reconstructions on large scenes, but optimize per scene, which is expensive and slow, and exhibit noticeable artifacts under large view changes due to overfitting. Generalizable approaches or large reconstruction models are fast, but primarily work for small scenes/objects and often produce lower quality rendering results. In this work, we introduce G3R, a generalizable reconstruction approach that can efficiently predict high-quality 3D scene representations for large scenes. We propose to learn a reconstruction network that takes the gradient feedback signals from differentiable rendering to iteratively update a 3D scene representation, combining the benefits of high photorealism from per-scene optimization with data-driven priors from fast feed-forward prediction methods. Experiments on urban-driving and drone datasets show that G3R generalizes across diverse large scenes and accelerates the reconstruction process by at least 10x while achieving comparable or better realism compared to 3DGS, and also being more robust to large view changes.

Yun Chen, Jingkang Wang, Ze Yang, Sivabalan Manivasagam, Raquel Urtasun• 2024

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisDL3DV Zero-shot 8 views, 512 x 960
PSNR27.04
10
3D ReconstructionDL3DV Zero-shot 32 views, 256 x 448
PSNR29.56
9
Urban Scene ReconstructionPandaSet Interpolation (test)
PSNR23.28
8
Urban Scene ReconstructionPandaSet Extrapolation Hard (test)
FID@4m174.6
8
Urban Scene ReconstructionPandaSet Extrapolation Moderate (test)
FID@1m89.75
8
Novel View SynthesisRealEstate10K 8 views, 512 x 960 (test)
PSNR28.01
6
Sparse-view SynthesisPandaSet Sparse View Split (10% frames for train)
PSNR18.37
5
360° View SynthesisPandaSet 360° View (Rotating actors 0° to 360°)
FID191.9
4
Novel Camera SynthesisPandaSet (Novel Camera Split)
PSNR17.4
4
Showing 9 of 9 rows

Other info

Follow for update