HexPlane: A Fast Representation for Dynamic Scenes
About
Modeling and re-rendering dynamic 3D scenes is a challenging task in 3D vision. Prior approaches build on NeRF and rely on implicit representations. This is slow since it requires many MLP evaluations, constraining real-world applications. We show that dynamic 3D scenes can be explicitly represented by six planes of learned features, leading to an elegant solution we call HexPlane. A HexPlane computes features for points in spacetime by fusing vectors extracted from each plane, which is highly efficient. Pairing a HexPlane with a tiny MLP to regress output colors and training via volume rendering gives impressive results for novel view synthesis on dynamic scenes, matching the image quality of prior work but reducing training time by more than $100\times$. Extensive ablations confirm our HexPlane design and show that it is robust to different feature fusion mechanisms, coordinate systems, and decoding mechanisms. HexPlane is a simple and effective solution for representing 4D volumes, and we hope they can broadly contribute to modeling spacetime for dynamic 3D scenes.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Novel View Synthesis | D-NeRF synthetic (test) | Average PSNR31.04 | 42 | |
| Novel View Synthesis | Blender (test) | PSNR31.04 | 37 | |
| Novel View Synthesis | Neural 3D Video Dataset Standard (All six scenes) | PSNR31.71 | 36 | |
| Dynamic Scene Reconstruction | N3DV (test) | PSNR31.7 | 32 | |
| Dynamic Scene Reconstruction | N3DV coffee martini (test) | PSNR31.7 | 18 | |
| Novel View Synthesis | Neu3D (test) | PSNR31.7 | 18 | |
| Dynamic Scene Reconstruction | Neural 3D Video 19 (full) | PSNR31.71 | 17 | |
| Dynamic View Synthesis | Neural 3D Video 19 (test) | PSNR31.71 | 16 | |
| 3D Video Synthesis | Neural 3D Video Dataset (Cut Roasted Beef scene) | PSNR30.83 | 12 | |
| Novel View Synthesis | Plenoptic Video all scenes average | PSNR31.705 | 11 |