NeRFPlayer: A Streamable Dynamic Scene Representation with Decomposed Neural Radiance Fields

About

Visually exploring in a real-world 4D spatiotemporal space freely in VR has been a long-term quest. The task is especially appealing when only a few or even single RGB cameras are used for capturing the dynamic scene. To this end, we present an efficient framework capable of fast reconstruction, compact modeling, and streamable rendering. First, we propose to decompose the 4D spatiotemporal space according to temporal characteristics. Points in the 4D space are associated with probabilities of belonging to three categories: static, deforming, and new areas. Each area is represented and regularized by a separate neural field. Second, we propose a hybrid representations based feature streaming scheme for efficiently modeling the neural fields. Our approach, coined NeRFPlayer, is evaluated on dynamic scenes captured by single hand-held cameras and multi-camera arrays, achieving comparable or superior rendering performance in terms of quality and speed comparable to recent state-of-the-art methods, achieving reconstruction in 10 seconds per frame and interactive rendering.

Liangchen Song, Anpei Chen, Zhong Li, Zhang Chen, Lele Chen, Junsong Yuan, Yi Xu, Andreas Geiger• 2022

Related benchmarks

Task	Dataset	Result
Novel View Synthesis	Neural 3D Video Dataset Standard (All six scenes)	PSNR30.69	47
Dynamic Scene Reconstruction	N3DV (test)	PSNR30.69	45
Dynamic Scene Reconstruction	N3DV	PSNR (dB)30.69	28
Dynamic Scene Reconstruction	N3V	Cook Spinach Score0.113	24
Novel View Synthesis	Neural 3D Video (Neu3DV)	PSNR30.69	21
Novel View Synthesis	DyNeRF (test)	PSNR31.93	18
Dynamic Scene Reconstruction	Neu3D (all scenes)	PSNR30.29	18
Novel View Synthesis	N3V datasets	PSNR30.96	18
Novel View Synthesis	Neu3D (test)	PSNR30.69	18
Dynamic Scene Reconstruction	Neural 3D Video 19 (full)	PSNR30.96	17

Showing 10 of 33 rows

Other info

Follow for update

@wizwand_team Discord