Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Enhancing NeRF akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-Experts

About

Cross-scene generalizable NeRF models, which can directly synthesize novel views of unseen scenes, have become a new spotlight of the NeRF field. Several existing attempts rely on increasingly end-to-end "neuralized" architectures, i.e., replacing scene representation and/or rendering modules with performant neural networks such as transformers, and turning novel view synthesis into a feed-forward inference pipeline. While those feedforward "neuralized" architectures still do not fit diverse scenes well out of the box, we propose to bridge them with the powerful Mixture-of-Experts (MoE) idea from large language models (LLMs), which has demonstrated superior generalization ability by balancing between larger overall model capacity and flexible per-instance specialization. Starting from a recent generalizable NeRF architecture called GNT, we first demonstrate that MoE can be neatly plugged in to enhance the model. We further customize a shared permanent expert and a geometry-aware consistency loss to enforce cross-scene consistency and spatial smoothness respectively, which are essential for generalizable view synthesis. Our proposed model, dubbed GNT with Mixture-of-View-Experts (GNT-MOVE), has experimentally shown state-of-the-art results when transferring to unseen scenes, indicating remarkably better cross-scene generalization in both zero-shot and few-shot settings. Our codes are available at https://github.com/VITA-Group/GNT-MOVE.

Wenyan Cong, Hanxue Liang, Peihao Wang, Zhiwen Fan, Tianlong Chen, Mukund Varma, Yi Wang, Zhangyang Wang• 2023

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisLLFF
PSNR26.02
124
Novel View SynthesisNeRF Synthetic
PSNR27.47
92
Novel View SynthesisTanks&Temples
SSIM64
39
Novel View SynthesisLLFF 3-shot
PSNR19.58
17
View SynthesisTanks&Temples
PSNR20.1
15
View SynthesisShiny-6 (test)
PSNR27.54
11
Novel View SynthesisNMR
PSNR32.12
5
Novel View SynthesisLLFF 6-shot
PSNR22.36
5
Novel View SynthesisNeRF Synthetic 6-shot
PSNR22.39
5
Novel View SynthesisNeRF Synthetic 12-shot
PSNR25.25
5
Showing 10 of 11 rows

Other info

Follow for update