Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Fast and Lightweight Novel View Synthesis with Differentiable Multiplane Image

About

Recently, novel view synthesis has witnessed remarkable progress, with mainstream methods such as Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS) delivering impressive results. However, these approaches often struggle to balance rendering speed and model size, and their optimization-based training can be highly time-consuming. Furthermore, they typically rely on dense observations, often failing to produce satisfactory results under sparse-view conditions. Although feed-forward reconstruction significantly reduces the optimization time of 3DGS, its pixel-aligned formulation generates millions of Gaussians from a single image, severely limiting its practical deployment on mobile devices. To address these limitations, we revisit the Multiplane Image(MPI) representation, which represents scenes using a compact set of planar layers for efficient novel view synthesis. Leveraging recent advances in visual foundation models, we utilize predicted point maps for reliable geometric initialization, followed by differentiable optimization. To address the issues of holes and artifacts in sparsely initialized MPI, we introduce one-step diffusion, which participates in both the differentiable optimization of MPI and the postprocessing of rendering results. Compared with a representative GS-based method, our approach is 30.7% faster and uses only 14.8% of its model size, while achieving competitive synthesis quality on front-view scenarios

Kaidi Zhang, Guanxu Zhu• 2026

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisLLFF (test)
PSNR25.469
96
Novel View SynthesisNeRF Synthetic locally forward-facing
PSNR29.161
5
Showing 2 of 2 rows

Other info

Follow for update