3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos
About
Constructing photo-realistic Free-Viewpoint Videos (FVVs) of dynamic scenes from multi-view videos remains a challenging endeavor. Despite the remarkable advancements achieved by current neural rendering techniques, these methods generally require complete video sequences for offline training and are not capable of real-time rendering. To address these constraints, we introduce 3DGStream, a method designed for efficient FVV streaming of real-world dynamic scenes. Our method achieves fast on-the-fly per-frame reconstruction within 12 seconds and real-time rendering at 200 FPS. Specifically, we utilize 3D Gaussians (3DGs) to represent the scene. Instead of the na\"ive approach of directly optimizing 3DGs per-frame, we employ a compact Neural Transformation Cache (NTC) to model the translations and rotations of 3DGs, markedly reducing the training time and storage required for each FVV frame. Furthermore, we propose an adaptive 3DG addition strategy to handle emerging objects in dynamic scenes. Experiments demonstrate that 3DGStream achieves competitive performance in terms of rendering speed, image quality, training time, and model storage when compared with state-of-the-art methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Novel View Synthesis | Neural 3D Video Dataset Standard (All six scenes) | PSNR31.67 | 36 | |
| Dynamic Scene Reconstruction | N3DV (test) | PSNR31.69 | 32 | |
| Novel View Synthesis | Neural 3D Video Dataset (Flame Salmon scene) | PSNR19.634 | 19 | |
| Dynamic 3D Reconstruction | N3DV | PSNR (dB)31.54 | 16 | |
| Dynamic Scene Reconstruction | Meet Room dataset (test) | PSNR (dB)30.79 | 15 | |
| Multi-view Dynamic Reconstruction | Neural 3D Video Dataset (N3DV) | PSNR31.67 | 14 | |
| Dynamic Scene Reconstruction | Meet Room (test) | PSNR30.79 | 14 | |
| Dynamic Scene Reconstruction | N3V | Coffee Martini Score27.75 | 14 | |
| Dynamic Scene Reconstruction | N3DV | PSNR (dB)31.67 | 11 | |
| Novel View Synthesis | PackUV-2B (test) | PSNR23.17 | 9 |