NVFi: Neural Velocity Fields for 3D Physics Learning from Dynamic Videos
About
In this paper, we aim to model 3D scene dynamics from multi-view videos. Unlike the majority of existing works which usually focus on the common task of novel view synthesis within the training time period, we propose to simultaneously learn the geometry, appearance, and physical velocity of 3D scenes only from video frames, such that multiple desirable applications can be supported, including future frame extrapolation, unsupervised 3D semantic scene decomposition, and dynamic motion transfer. Our method consists of three major components, 1) the keyframe dynamic radiance field, 2) the interframe velocity field, and 3) a joint keyframe and interframe optimization module which is the core of our framework to effectively train both networks. To validate our method, we further introduce two dynamic 3D datasets: 1) Dynamic Object dataset, and 2) Dynamic Indoor Scene dataset. We conduct extensive experiments on multiple datasets, demonstrating the superior performance of our method over all baselines, particularly in the critical tasks of future frame extrapolation and unsupervised 3D semantic scene decomposition.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Future frame extrapolation | Dynamic Indoor Scene Dataset | PSNR29.745 | 24 | |
| Novel view interpolation | Dynamic Indoor Scene Dataset | PSNR30.675 | 22 | |
| Future frame extrapolation | Dynamic Object Dataset | PSNR27.549 | 22 | |
| Novel view interpolation | Dynamic Object Dataset | PSNR29.027 | 20 | |
| Future frame extrapolation | NVIDIA Dynamic Scene Skating | PSNR28.654 | 12 | |
| Future frame extrapolation | NVIDIA Dynamic Scene Truck | PSNR28.269 | 12 | |
| Novel view interpolation | NVIDIA Dynamic Scene Truck | PSNR27.276 | 12 | |
| Novel view interpolation | NVIDIA Dynamic Scene Skating | PSNR26.999 | 12 | |
| Future frame extrapolation | Dynamic Multipart (test) | PSNR25.235 | 9 | |
| Unsupervised Object Segmentation | synthetic indoor scene dataset | AP91.21 | 7 |