Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Solving Video Inverse Problems Using Image Diffusion Models

About

Recently, diffusion model-based inverse problem solvers (DIS) have emerged as state-of-the-art approaches for addressing inverse problems, including image super-resolution, deblurring, inpainting, etc. However, their application to video inverse problems arising from spatio-temporal degradation remains largely unexplored due to the challenges in training video diffusion models. To address this issue, here we introduce an innovative video inverse solver that leverages only image diffusion models. Specifically, by drawing inspiration from the success of the recent decomposed diffusion sampler (DDS), our method treats the time dimension of a video as the batch dimension of image diffusion models and solves spatio-temporal optimization problems within denoised spatio-temporal batches derived from each image diffusion model. Moreover, we introduce a batch-consistent diffusion sampling strategy that encourages consistency across batches by synchronizing the stochastic noise components in image diffusion models. Our approach synergistically combines batch-consistent sampling with simultaneous optimization of denoised spatio-temporal batches at each reverse diffusion step, resulting in a novel and efficient diffusion sampling strategy for video inverse problems. Experimental results demonstrate that our method effectively addresses various spatio-temporal degradations in video inverse problems, achieving state-of-the-art reconstructions. Project page: https://svi-diffusion.github.io/

Taesung Kwon, Jong Chul Ye• 2024

Related benchmarks

TaskDatasetResultRank
Video Restoration Efficiency25-frame video clip 1280 x 768 resolution
Time (s)13.6
5
Video Restoration (Problem B: Temporal blur + Spatial SRx8)Adobe240
FVMD128.2
5
Video Super-ResolutionPexels videos (initial 81 frames)
Latency (s)167
5
Video Super-ResolutionVideo Restoration Dataset
PSNR29.89
5
Video Gaussian DeblurVideo Restoration Dataset
PSNR31.1
4
Video InpaintingVideo Restoration Dataset
PSNR29.39
4
Video Restoration (Problem C: Temp. SRx8 + SRx8)GoPro 240
FVMD969.3
4
Video Restoration (Problem C: Temporal SRx8 + Spatial SRx8)Adobe240
FVMD1.65e+3
4
Video Spatio-Temporal AverageVideo Restoration Dataset
PSNR29.62
4
Video Temporal AverageVideo Restoration Dataset
PSNR30.46
4
Showing 10 of 15 rows

Other info

Follow for update