Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training

About

Recent advances in diffusion-based video restoration (VR) demonstrate significant improvement in visual quality, yet yield a prohibitive computational cost during inference. While several distillation-based approaches have exhibited the potential of one-step image restoration, extending existing approaches to VR remains challenging and underexplored, particularly when dealing with high-resolution video in real-world settings. In this work, we propose a one-step diffusion-based VR model, termed as SeedVR2, which performs adversarial VR training against real data. To handle the challenging high-resolution VR within a single step, we introduce several enhancements to both model architecture and training procedures. Specifically, an adaptive window attention mechanism is proposed, where the window size is dynamically adjusted to fit the output resolutions, avoiding window inconsistency observed under high-resolution VR using window attention with a predefined window size. To stabilize and improve the adversarial post-training towards VR, we further verify the effectiveness of a series of losses, including a proposed feature matching loss without significantly sacrificing training efficiency. Extensive experiments show that SeedVR2 can achieve comparable or even better performance compared with existing VR approaches in a single step.

Jianyi Wang, Shanchuan Lin, Zhijie Lin, Yuxi Ren, Meng Wei, Zongsheng Yue, Shangchen Zhou, Hao Chen, Yang Zhao, Ceyuan Yang, Xuefeng Xiao, Chen Change Loy, Lu Jiang• 2025

Related benchmarks

TaskDatasetResultRank
Video Super-ResolutionSPMCS (test)
Avg. PSNR19.147
36
Video RestorationREDS30
PSNR25.43
17
Video RestorationREDS30 (test)
PSNR26.38
10
Video RestorationYouHQ40 Spatio-Temporal Downsampling
PSNR24.77
10
Video RestorationYouHQ40 Spatio-Temporal Light
PSNR21.47
10
Video RestorationUDM10 (test)
PSNR28.634
10
Video RestorationREDS Spatio-Temporal Light 30
PSNR20.59
10
Video RestorationREDS30 Spatio-Temporal Strong
PSNR20.41
10
Video RestorationYouHQ40 Spatial Downsampling
PSNR24.2
10
Video RestorationREDS30 Spatial Downsampling
PSNR22.59
10
Showing 10 of 15 rows

Other info

Follow for update