WildGaussians: 3D Gaussian Splatting in the Wild
About
While the field of 3D scene reconstruction is dominated by NeRFs due to their photorealistic quality, 3D Gaussian Splatting (3DGS) has recently emerged, offering similar quality with real-time rendering speeds. However, both methods primarily excel with well-controlled 3D scenes, while in-the-wild data - characterized by occlusions, dynamic objects, and varying illumination - remains challenging. NeRFs can adapt to such conditions easily through per-image embedding vectors, but 3DGS struggles due to its explicit representation and lack of shared parameters. To address this, we introduce WildGaussians, a novel approach to handle occlusions and appearance changes with 3DGS. By leveraging robust DINO features and integrating an appearance modeling module within 3DGS, our method achieves state-of-the-art results. We demonstrate that WildGaussians matches the real-time rendering speed of 3DGS while surpassing both 3DGS and NeRF baselines in handling in-the-wild data, all within a simple architectural framework.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Novel View Synthesis | D-RE10K-iPhone full-image fidelity (test) | PSNR20.44 | 26 | |
| Novel View Synthesis | D-RE10K static regions only (test) | PSNR18.11 | 26 | |
| Novel View Synthesis | Sacre Coeur Phototourism (test) | PSNR22.71 | 16 | |
| Novel View Synthesis | Trevi Fountain Phototourism (test) | PSNR23.7 | 16 | |
| Novel View Synthesis | Photo Tourism Brandenburg Gate | PSNR27.77 | 12 | |
| Novel View Synthesis | Photo Tourism Trevi Fountain | PSNR23.63 | 12 | |
| Novel View Synthesis | Photo Tourism Sacre Coeur | PSNR22.56 | 12 | |
| Novel View Synthesis | Brandenburg Gate Phototourism (test) | PSNR27.29 | 11 | |
| Illumination Correction | EndoNeRF EC (Pulling) | PSNR15.28 | 10 | |
| Illumination Correction | EndoNeRF EC (Cutting) | PSNR17.71 | 10 |