VEnhancer: Generative Space-Time Enhancement for Video Generation
About
We present VEnhancer, a generative space-time enhancement framework that improves the existing text-to-video results by adding more details in spatial domain and synthetic detailed motion in temporal domain. Given a generated low-quality video, our approach can increase its spatial and temporal resolution simultaneously with arbitrary up-sampling space and time scales through a unified video diffusion model. Furthermore, VEnhancer effectively removes generated spatial artifacts and temporal flickering of generated videos. To achieve this, basing on a pretrained video diffusion model, we train a video ControlNet and inject it to the diffusion model as a condition on low frame-rate and low-resolution videos. To effectively train this video ControlNet, we design space-time data augmentation as well as video-aware conditioning. Benefiting from the above designs, VEnhancer yields to be stable during training and shares an elegant end-to-end training manner. Extensive experiments show that VEnhancer surpasses existing state-of-the-art video super-resolution and space-time super-resolution methods in enhancing AI-generated videos. Moreover, with VEnhancer, exisiting open-source state-of-the-art text-to-video method, VideoCrafter-2, reaches the top one in video generation benchmark -- VBench.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Super-Resolution | UDM10 (test) | PSNR21.64 | 51 | |
| Video Super-Resolution | UDM10 | PSNR23.38 | 48 | |
| Video Super-Resolution | SPMCS (test) | Avg. PSNR19.272 | 45 | |
| Video Super-Resolution | SPMCS | PSNR19.92 | 35 | |
| Video Super-Resolution | MVSR4x | PSNR20.5 | 22 | |
| Video Super-Resolution | YouHQ40 | PSNR19.78 | 18 | |
| Video Super-Resolution | RealVSR | PSNR15.75 | 18 | |
| Video Restoration | REDS30 | PSNR22.4 | 17 | |
| Video Super-Resolution | AIGC60 | NIQE5.86 | 12 | |
| Video Super-Resolution | VideoGen30 (test) | Visual Quality2.686 | 10 |