Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SfM-Free 3D Gaussian Splatting via Hierarchical Training

About

Standard 3D Gaussian Splatting (3DGS) relies on known or pre-computed camera poses and a sparse point cloud, obtained from structure-from-motion (SfM) preprocessing, to initialize and grow 3D Gaussians. We propose a novel SfM-Free 3DGS (SFGS) method for video input, eliminating the need for known camera poses and SfM preprocessing. Our approach introduces a hierarchical training strategy that trains and merges multiple 3D Gaussian representations -- each optimized for specific scene regions -- into a single, unified 3DGS model representing the entire scene. To compensate for large camera motions, we leverage video frame interpolation models. Additionally, we incorporate multi-source supervision to reduce overfitting and enhance representation. Experimental results reveal that our approach significantly surpasses state-of-the-art SfM-free novel view synthesis methods. On the Tanks and Temples dataset, we improve PSNR by an average of 2.25dB, with a maximum gain of 3.72dB in the best scene. On the CO3D-V2 dataset, we achieve an average PSNR boost of 1.74dB, with a top gain of 3.90dB. The code is available at https://github.com/jibo27/3DGS_Hierarchical_Training.

Bo Ji, Angela Yao• 2024

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisTanks&Temples (test)
PSNR35.82
257
Novel View SynthesisMip-NeRF360
PSNR14.79
138
Novel View SynthesisTanks&Temples
PSNR13.83
95
Novel View SynthesisSmallCity
PSNR13.99
19
Novel View SynthesisRigScapes Aerial+Road
PSNR15.33
11
Novel View SynthesisCO3D v2
PSNR28.28
8
Novel View SynthesisOB3D Egocentric
PSNR (Indoor)26.24
6
Novel View SynthesisOB3D NonEgocentric
PSNR (Indoor)22.92
6
Novel View SynthesisRicoh360
PSNR23
6
Camera pose estimationCO3D V2 (110_13051_23361)
RPE_t0.045
5
Showing 10 of 26 rows

Other info

Code

Follow for update