Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LaRa: Efficient Large-Baseline Radiance Fields

About

Radiance field methods have achieved photorealistic novel view synthesis and geometry reconstruction. But they are mostly applied in per-scene optimization or small-baseline settings. While several recent works investigate feed-forward reconstruction with large baselines by utilizing transformers, they all operate with a standard global attention mechanism and hence ignore the local nature of 3D reconstruction. We propose a method that unifies local and global reasoning in transformer layers, resulting in improved quality and faster convergence. Our model represents scenes as Gaussian Volumes and combines this with an image encoder and Group Attention Layers for efficient feed-forward reconstruction. Experimental results demonstrate that our model, trained for two days on four GPUs, demonstrates high fidelity in reconstructing 360 deg radiance fields, and robustness to zero-shot and out-of-domain testing. Our project Page: https://apchenstu.github.io/LaRa/.

Anpei Chen, Haofei Xu, Stefano Esposito, Siyu Tang, Andreas Geiger• 2024

Related benchmarks

TaskDatasetResultRank
3D ReconstructionGoogle Scanned Objects (GSO) (test)
LPIPS0.06
17
Image-conditioned 3D GenerationObjaverse (test)
FID43.74
10
3D ReconstructionObjaverse (test)
PSNR27.49
9
Text-conditioned 3D asset generationG-Objaverse (evaluation)
IS12.77
8
Object-level ReconstructionCo3D (test)
PSNR21.18
7
3D ReconstructionObjaverse full observation
PSNR31.91
5
3D ReconstructionObjaverse
PSNR27.79
5
3D ReconstructionGSO full observation
PSNR29.15
5
Image-conditioned 3D Asset GenerationG-Objaverse
PSNR13.32
5
Showing 9 of 9 rows

Other info

Follow for update