SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration

About

Video restoration poses non-trivial challenges in maintaining fidelity while recovering temporally consistent details from unknown degradations in the wild. Despite recent advances in diffusion-based restoration, these methods often face limitations in generation capability and sampling efficiency. In this work, we present SeedVR, a diffusion transformer designed to handle real-world video restoration with arbitrary length and resolution. The core design of SeedVR lies in the shifted window attention that facilitates effective restoration on long video sequences. SeedVR further supports variable-sized windows near the boundary of both spatial and temporal dimensions, overcoming the resolution constraints of traditional window attention. Equipped with contemporary practices, including causal video autoencoder, mixed image and video training, and progressive training, SeedVR achieves highly-competitive performance on both synthetic and real-world benchmarks, as well as AI-generated videos. Extensive experiments demonstrate SeedVR's superiority over existing methods for generic video restoration.

Jianyi Wang, Zhijie Lin, Meng Wei, Yang Zhao, Ceyuan Yang, Fei Xiao, Chen Change Loy, Lu Jiang• 2025

Related benchmarks

Task	Dataset	Result
Video Super-Resolution	UDM10	PSNR24.39	111
Video Super-Resolution	SPMCS	PSNR21.73	68
Video Super-Resolution	UDM10 (test)	PSNR25.76	51
Video Super-Resolution	MVSR4x	PSNR22.16	49
Video Super-Resolution	SPMCS (test)	Avg. PSNR22.37	45
Video Super-Resolution	RealVSR	PSNR20.44	28
Video Restoration	UDM10 (test)	PSNR27.8	19
Video Super-Resolution	VideoLQ	MUSIQ54.41	17
Time Series Reconstruction	TS-S12 (test)	PSNR29.06	13
Video Super-Resolution	video 33-frame 720x1280	Inference Time (s)207.1	13

Showing 10 of 31 rows

Other info

Code

Follow for update

@wizwand_team Discord