Progressively Deformable 2D Gaussian Splatting for Video Representation at Arbitrary Resolutions

About

Implicit neural representations (INRs) enable fast video compression and effective video processing, but a single model rarely offers scalable decoding across rates and resolutions. In practice, multi-resolution typically relies on retraining or multi-branch designs, and structured pruning failed to provide a permutation-invariant progressive transmission order. Motivated by the explicit structure and efficiency of Gaussian splatting, we propose D2GV-AR, a deformable 2D Gaussian video representation that enables \emph{arbitrary-scale} rendering and \emph{any-ratio} progressive coding within a single model. We partition each video into fixed-length Groups of Pictures and represent each group with a canonical set of 2D Gaussian primitives, whose temporal evolution is modeled by a neural ordinary differential equation. During training and rendering, we apply scale-aware grouping according to Nyquist sampling theorem to form a nested hierarchy across resolutions. Once trained, primitives can be pruned via a D-optimal subset objective to enable any-ratio progressive coding. Extensive experiments show that D2GV-AR renders at over 250 FPS while matching or surpassing recent INR baselines, enabling multiscale continuous rate--quality adaptation.

Mufan Liu, Qi Yang, Miaoran Zhao, He Huang, Le Yang, Zhu Li, Yiling Xu• 2025

Related benchmarks

Task	Dataset	Result
Video Representation	Bunny dataset	PSNR37.26	26
Video Representation	DAVIS	PSNR (Average)31.09	11
Video Denoising	HoneyBee video sequence	White PSNR40.14	8
Video Representation	UVG (full)	PSNR34.31	6
Video Inpainting	Bee, Swan, Cows, Bos., Beauty sequences	PSNR (Bee)40.32	3

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord