GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis
About
We present a new approach, termed GPS-Gaussian, for synthesizing novel views of a character in a real-time manner. The proposed method enables 2K-resolution rendering under a sparse-view camera setting. Unlike the original Gaussian Splatting or neural implicit rendering methods that necessitate per-subject optimizations, we introduce Gaussian parameter maps defined on the source views and regress directly Gaussian Splatting properties for instant novel view synthesis without any fine-tuning or optimization. To this end, we train our Gaussian parameter regression module on a large amount of human scan data, jointly with a depth estimation module to lift 2D parameter maps to 3D space. The proposed framework is fully differentiable and experiments on several datasets demonstrate that our method outperforms state-of-the-art methods while achieving an exceeding rendering speed.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Novel View Synthesis | THuman 2.0 (test) | LPIPS0.088 | 39 | |
| 3D human reconstruction | ZJU-MoCap (test) | PSNR29.06 | 31 | |
| Human Novel View Synthesis | ZJU-MoCap | PSNR29.68 | 31 | |
| Novel View Synthesis | DyNeRF (test) | PSNR25.46 | 9 | |
| Novel View Synthesis | Real-world Data (test) | PSNR24.64 | 8 | |
| Novel View Synthesis | THuman 2.0 69 (val) | PSNR25.57 | 5 | |
| Novel View Synthesis | Twindom 55 (val) | PSNR24.79 | 5 | |
| Sparse-view Human Reconstruction | RenderPeople (test) | PSNR25.11 | 5 | |
| Sparse-view Human Reconstruction | Real-world data | PSNR21.55 | 5 | |
| Novel View Synthesis | ENeRF-Outdoor (test) | TCC0.812 | 5 |