VEnhancer: Generative Space-Time Enhancement for Video Generation

About

We present VEnhancer, a generative space-time enhancement framework that improves the existing text-to-video results by adding more details in spatial domain and synthetic detailed motion in temporal domain. Given a generated low-quality video, our approach can increase its spatial and temporal resolution simultaneously with arbitrary up-sampling space and time scales through a unified video diffusion model. Furthermore, VEnhancer effectively removes generated spatial artifacts and temporal flickering of generated videos. To achieve this, basing on a pretrained video diffusion model, we train a video ControlNet and inject it to the diffusion model as a condition on low frame-rate and low-resolution videos. To effectively train this video ControlNet, we design space-time data augmentation as well as video-aware conditioning. Benefiting from the above designs, VEnhancer yields to be stable during training and shares an elegant end-to-end training manner. Extensive experiments show that VEnhancer surpasses existing state-of-the-art video super-resolution and space-time super-resolution methods in enhancing AI-generated videos. Moreover, with VEnhancer, exisiting open-source state-of-the-art text-to-video method, VideoCrafter-2, reaches the top one in video generation benchmark -- VBench.

Jingwen He, Tianfan Xue, Dongyang Liu, Xinqi Lin, Peng Gao, Dahua Lin, Yu Qiao, Wanli Ouyang, Ziwei Liu• 2024

Related benchmarks

Task	Dataset	Result
Video Super-Resolution	REDS4 (test)	PSNR (Avg)23.41	231
Video Super-Resolution	Vid4 (test)	PSNR20.58	206
Video Super-Resolution	UDM10	PSNR23.38	111
Video Super-Resolution	SPMCS	PSNR19.92	68
Video Super-Resolution	UDM10 (test)	PSNR21.64	51
Video Super-Resolution	MVSR4x	PSNR20.5	49
Video Super-Resolution	SPMCS (test)	Avg. PSNR19.272	45
Video Super-Resolution	RealVSR	PSNR15.75	28
Video Face Restoration	VFHQ (test)	PSNR20.47	25
Video Restoration	UDM10 (test)	PSNR25.308	19

Showing 10 of 47 rows

Other info

Follow for update

@wizwand_team Discord