Neural Residual Radiance Fields for Streamably Free-Viewpoint Videos
About
The success of the Neural Radiance Fields (NeRFs) for modeling and free-view rendering static objects has inspired numerous attempts on dynamic scenes. Current techniques that utilize neural rendering for facilitating free-view videos (FVVs) are restricted to either offline rendering or are capable of processing only brief sequences with minimal motion. In this paper, we present a novel technique, Residual Radiance Field or ReRF, as a highly compact neural representation to achieve real-time FVV rendering on long-duration dynamic scenes. ReRF explicitly models the residual information between adjacent timestamps in the spatial-temporal feature space, with a global coordinate-based tiny MLP as the feature decoder. Specifically, ReRF employs a compact motion grid along with a residual feature grid to exploit inter-frame feature similarities. We show such a strategy can handle large motions without sacrificing quality. We further present a sequential training scheme to maintain the smoothness and the sparsity of the motion/residual grids. Based on ReRF, we design a special FVV codec that achieves three orders of magnitudes compression rate and provides a companion ReRF player to support online streaming of long-duration FVVs of dynamic scenes. Extensive experiments demonstrate the effectiveness of ReRF for compactly representing dynamic radiance fields, enabling an unprecedented free-viewpoint viewing experience in speed and quality.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Novel View Synthesis | Tanks&Temples (test) | PSNR28.3 | 239 | |
| Dynamic Scene Reconstruction | Actors-HQ (Actor 3, Sequence 1) | LPIPS0.292 | 59 | |
| Novel View Synthesis | Synthetic-NeRF (test) | PSNR31.81 | 48 | |
| Dynamic 3D Reconstruction | N3DV | PSNR (dB)29.71 | 16 | |
| Dynamic Scene Reconstruction | Meet Room dataset (test) | PSNR (dB)26.43 | 15 | |
| Novel View Synthesis | ReRF views (6 and 39) (test) | PSNR30.33 | 6 | |
| Dynamic Scene Reconstruction | Dynamic Scene Reconstruction (50 frames) | PSNR37.03 | 6 | |
| Dynamic Scene Reconstruction | Dynamic Scene Reconstruction (200 frames) | PSNR37.02 | 6 | |
| Novel View Synthesis | NHR views (5 and 41) (test) | PSNR30.34 | 6 | |
| Dynamic Scene Reconstruction | ReRF scene Kpop | PSNR31.84 | 5 |