Solving Video Inverse Problems Using Image Diffusion Models

About

Recently, diffusion model-based inverse problem solvers (DIS) have emerged as state-of-the-art approaches for addressing inverse problems, including image super-resolution, deblurring, inpainting, etc. However, their application to video inverse problems arising from spatio-temporal degradation remains largely unexplored due to the challenges in training video diffusion models. To address this issue, here we introduce an innovative video inverse solver that leverages only image diffusion models. Specifically, by drawing inspiration from the success of the recent decomposed diffusion sampler (DDS), our method treats the time dimension of a video as the batch dimension of image diffusion models and solves spatio-temporal optimization problems within denoised spatio-temporal batches derived from each image diffusion model. Moreover, we introduce a batch-consistent diffusion sampling strategy that encourages consistency across batches by synchronizing the stochastic noise components in image diffusion models. Our approach synergistically combines batch-consistent sampling with simultaneous optimization of denoised spatio-temporal batches at each reverse diffusion step, resulting in a novel and efficient diffusion sampling strategy for video inverse problems. Experimental results demonstrate that our method effectively addresses various spatio-temporal degradations in video inverse problems, achieving state-of-the-art reconstructions. Project page: https://svi-diffusion.github.io/

Taesung Kwon, Jong Chul Ye• 2024

Related benchmarks

Task	Dataset	Result
Video Restoration Efficiency	25-frame video clip 1280 x 768 resolution	Time (s)13.6	5
Video Restoration (Problem B: Temporal blur + Spatial SRx8)	Adobe240	FVMD128.2	5
Video Super-Resolution	Pexels videos (initial 81 frames)	Latency (s)167	5
Video Super-Resolution	Video Restoration Dataset	PSNR29.89	5
Video Gaussian Deblur	Video Restoration Dataset	PSNR31.1	4
Video Inpainting	Video Restoration Dataset	PSNR29.39	4
Video Restoration (Problem C: Temp. SRx8 + SRx8)	GoPro 240	FVMD969.3	4
Video Restoration (Problem C: Temporal SRx8 + Spatial SRx8)	Adobe240	FVMD1.65e+3	4
Video Spatio-Temporal Average	Video Restoration Dataset	PSNR29.62	4
Video Temporal Average	Video Restoration Dataset	PSNR30.46	4

Showing 10 of 15 rows

Other info

Follow for update

@wizwand_team Discord