Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

No Pose at All: Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views

About

We introduce SPFSplat, an efficient framework for 3D Gaussian splatting from sparse multi-view images, requiring no ground-truth poses during training or inference. It employs a shared feature extraction backbone, enabling simultaneous prediction of 3D Gaussian primitives and camera poses in a canonical space from unposed inputs within a single feed-forward step. Alongside the rendering loss based on estimated novel-view poses, a reprojection loss is integrated to enforce the learning of pixel-aligned Gaussian primitives for enhanced geometric constraints. This pose-free training paradigm and efficient one-step feed-forward design make SPFSplat well-suited for practical applications. Remarkably, despite the absence of pose supervision, SPFSplat achieves state-of-the-art performance in novel view synthesis even under significant viewpoint changes and limited image overlap. It also surpasses recent methods trained with geometry priors in relative pose estimation. Code and trained models are available on our project page: https://ranrhuang.github.io/spfsplat/.

Ranran Huang, Krystian Mikolajczyk• 2025

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisRE10K
SSIM88.8
161
Novel View SynthesisScanNet
PSNR25.85
130
Novel View SynthesisDTU
PSNR14.042
115
Novel View SynthesisDL3DV
PSNR18.091
84
Novel View SynthesisRe10K (test)
PSNR24.97
79
Novel View SynthesisScanNet++
PSNR14
74
Novel View SynthesisACID
PSNR25.965
71
Novel View SynthesisACID (test)
PSNR25.07
47
Novel View SynthesisMip-NeRF 360
PSNR14.65
44
Novel View SynthesisRE10K (Medium)
PSNR25.334
41
Showing 10 of 53 rows

Other info

Follow for update