SfM-Free 3D Gaussian Splatting via Hierarchical Training
About
Standard 3D Gaussian Splatting (3DGS) relies on known or pre-computed camera poses and a sparse point cloud, obtained from structure-from-motion (SfM) preprocessing, to initialize and grow 3D Gaussians. We propose a novel SfM-Free 3DGS (SFGS) method for video input, eliminating the need for known camera poses and SfM preprocessing. Our approach introduces a hierarchical training strategy that trains and merges multiple 3D Gaussian representations -- each optimized for specific scene regions -- into a single, unified 3DGS model representing the entire scene. To compensate for large camera motions, we leverage video frame interpolation models. Additionally, we incorporate multi-source supervision to reduce overfitting and enhance representation. Experimental results reveal that our approach significantly surpasses state-of-the-art SfM-free novel view synthesis methods. On the Tanks and Temples dataset, we improve PSNR by an average of 2.25dB, with a maximum gain of 3.72dB in the best scene. On the CO3D-V2 dataset, we achieve an average PSNR boost of 1.74dB, with a top gain of 3.90dB. The code is available at https://github.com/jibo27/3DGS_Hierarchical_Training.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Novel View Synthesis | Tanks&Temples (test) | PSNR35.82 | 239 | |
| Novel View Synthesis | RigScapes Aerial+Road | PSNR15.33 | 11 | |
| Novel View Synthesis | SmallCity | PSNR13.99 | 11 | |
| Camera pose estimation | CO3D V2 (110_13051_23361) | RPE_t0.045 | 5 | |
| Camera pose estimation | CO3D V2 (415_57112_110099) | RPE Translation0.049 | 5 | |
| Camera pose estimation | CO3D 106_12648_23157 V2 | RPE (Translation)0.064 | 5 | |
| Camera pose estimation | CO3D V2 (245_26182_52130) | RPE (Translation)0.093 | 5 | |
| Camera pose estimation | CO3D V2 (34_1403_4393) | RPE (t)0.041 | 5 | |
| Novel View Synthesis | CO3D V2 (Sequence 110_13051_23361) | PSNR29.95 | 5 | |
| Novel View Synthesis | CO3D v2 (Sequence 415_57112_110099) | PSNR27.23 | 5 |