Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second

About

We present MoVieS, a Motion-aware View Synthesis model that reconstructs 4D dynamic scenes from monocular videos in one second. It represents dynamic 3D scenes with pixel-aligned Gaussian primitives and explicitly supervises their time-varying motions. This allows, for the first time, the unified modeling of appearance, geometry and motion from monocular videos, and enables reconstruction, view synthesis and 3D point tracking within a single learning-based framework. By bridging view synthesis with geometry reconstruction, MoVieS enables large-scale training on diverse datasets with minimal dependence on task-specific supervision. As a result, it also naturally supports a wide range of zero-shot applications, such as scene flow estimation and moving object segmentation. Extensive experiments validate the effectiveness and efficiency of MoVieS across multiple tasks, achieving competitive performance while offering several orders of magnitude speedups.

Chenguo Lin, Yuchen Lin, Panwang Pan, Yifan Yu, Tao Hu, Honglei Yan, Katerina Fragkiadaki, Yadong Mu• 2025

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisNVIDIA
PSNR19.16
20
Novel View SynthesisDyCheck (test)
mPSNR18.46
15
Novel View SynthesisADT
PSNR20.35
10
Novel View SynthesisTUM-D
PSNR14.91
10
Novel View SynthesisNVIDIA dataset (test)
Mean PSNR19.16
9
Novel View SynthesisExoRecon (held-out frames)
PSNR (Held-out Frames)18.78
9
3D Point TrackingPanoptic Studio
EPE_3D0.0352
7
View SynthesisN3DV Novel Cameras Moderate
PSNR14.54
6
Novel View SynthesisImmersive Light Field zero-shot
PSNR16.12
6
Novel View SynthesisKubric zero-shot
PSNR14.49
6
Showing 10 of 18 rows

Other info

Follow for update