DUO-VSR: Dual-Stream Distillation for One-Step Video Super-Resolution
About
Diffusion-based video super-resolution (VSR) has recently achieved remarkable fidelity but still suffers from prohibitive sampling costs. While distribution matching distillation (DMD) can accelerate diffusion models toward one-step generation, directly applying it to VSR often results in training instability alongside degraded and insufficient supervision. To address these issues, we propose DUO-VSR, a three-stage framework built upon a Dual-Stream Distillation strategy that unifies distribution matching and adversarial supervision for one-step VSR. Firstly, a Progressive Guided Distillation Initialization is employed to stabilize subsequent training through trajectory-preserving distillation. Next, the Dual-Stream Distillation jointly optimizes the DMD and Real-Fake Score Feature GAN (RFS-GAN) streams, with the latter providing complementary adversarial supervision leveraging discriminative features from both real and fake score models. Finally, a Preference-Guided Refinement stage further aligns the student with perceptual quality preferences. Extensive experiments demonstrate that DUO-VSR achieves superior visual quality and efficiency over previous one-step VSR approaches.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Super-Resolution | UDM10 | PSNR24.94 | 48 | |
| Video Super-Resolution | SPMCS | PSNR22.9 | 35 | |
| Video Super-Resolution | AIGC60 | NIQE4.42 | 12 | |
| Video Super-Resolution | VideoLQ | NIQE4.08 | 9 | |
| Video Super-Resolution | video 1920x1080 (21-frame sequence) | Step Count1 | 8 | |
| Video Super-Resolution | GSB (test) | Overall Quality0.00e+0 | 7 |