Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos

About

In this paper, we propose NeoVerse, a versatile 4D world model that is capable of 4D reconstruction, novel-trajectory video generation, and rich downstream applications. We first identify a common limitation of scalability in current 4D world modeling methods, caused either by expensive and specialized multi-view 4D data or by cumbersome training pre-processing. In contrast, our NeoVerse is built upon a core philosophy that makes the full pipeline scalable to diverse in-the-wild monocular videos. Specifically, NeoVerse features pose-free feed-forward 4D reconstruction, online monocular degradation pattern simulation, and other well-aligned techniques. These designs empower NeoVerse with versatility and generalization to various domains. Meanwhile, NeoVerse achieves state-of-the-art performance in standard reconstruction and generation benchmarks. Our project page is available at https://neoverse-4d.github.io

Yuxue Yang, Lue Fan, Ziqi Shi, Junran Peng, Feng Wang, Zhaoxiang Zhang• 2026

Related benchmarks

TaskDatasetResultRank
Novel View GenerationVBench 100 unseen in-the-wild videos 30
Inference Time (Generation)18
6
Static ReconstructionVRNeRF 16 views (test)
PSNR20.73
4
Static ReconstructionScannet++ 32 views (test)
PSNR25.34
4
Dynamic ReconstructionADT
PSNR32.56
3
Dynamic ReconstructionDyCheck
PSNR11.56
3
Showing 5 of 5 rows

Other info

GitHub

Follow for update