Progressively Deformable 2D Gaussian Splatting for Video Representation at Arbitrary Resolutions
About
Implicit neural representations (INRs) enable fast video compression and effective video processing, but a single model rarely offers scalable decoding across rates and resolutions. In practice, multi-resolution typically relies on retraining or multi-branch designs, and structured pruning failed to provide a permutation-invariant progressive transmission order. Motivated by the explicit structure and efficiency of Gaussian splatting, we propose D2GV-AR, a deformable 2D Gaussian video representation that enables \emph{arbitrary-scale} rendering and \emph{any-ratio} progressive coding within a single model. We partition each video into fixed-length Groups of Pictures and represent each group with a canonical set of 2D Gaussian primitives, whose temporal evolution is modeled by a neural ordinary differential equation. During training and rendering, we apply scale-aware grouping according to Nyquist sampling theorem to form a nested hierarchy across resolutions. Once trained, primitives can be pruned via a D-optimal subset objective to enable any-ratio progressive coding. Extensive experiments show that D2GV-AR renders at over 250 FPS while matching or surpassing recent INR baselines, enabling multiscale continuous rate--quality adaptation.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Representation | Bunny dataset | PSNR37.26 | 26 | |
| Video Representation | DAVIS | PSNR (Average)31.09 | 11 | |
| Video Denoising | HoneyBee video sequence | White PSNR40.14 | 8 | |
| Video Representation | UVG (full) | PSNR34.31 | 6 | |
| Video Inpainting | Bee, Swan, Cows, Bos., Beauty sequences | PSNR (Bee)40.32 | 3 |