Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DiffuEraser: A Diffusion Model for Video Inpainting

About

Recent video inpainting algorithms integrate flow-based pixel propagation with transformer-based generation to leverage optical flow for restoring textures and objects using information from neighboring frames, while completing masked regions through visual Transformers. However, these approaches often encounter blurring and temporal inconsistencies when dealing with large masks, highlighting the need for models with enhanced generative capabilities. Recently, diffusion models have emerged as a prominent technique in image and video generation due to their impressive performance. In this paper, we introduce DiffuEraser, a video inpainting model based on stable diffusion, designed to fill masked regions with greater details and more coherent structures. We incorporate prior information to provide initialization and weak conditioning,which helps mitigate noisy artifacts and suppress hallucinations. Additionally, to improve temporal consistency during long-sequence inference, we expand the temporal receptive fields of both the prior model and DiffuEraser, and further enhance consistency by leveraging the temporal smoothing property of Video Diffusion Models. Experimental results demonstrate that our proposed method outperforms state-of-the-art techniques in both content completeness and temporal consistency while maintaining acceptable efficiency.

Xiaowen Li, Haolan Xue, Peiran Ren, Liefeng Bo• 2025

Related benchmarks

TaskDatasetResultRank
Video Object RemovalReal-World Videos
Internal Physics Score2.19
21
Video Object RemovalScene-Bench
Removal Completeness4.0455
16
Video Object RemovalROSE Bench
LPIPS0.0885
13
Video EditingVIE-Bench
Instruction Following6.346
11
Video Object RemovalDAVIS
mPSNR33.76
9
Video Object RemovalCAMERA-Bench
PSNR26.7892
8
Video Object RemovalVOR-Wild without GT
QScore9.113
8
Video Object RemovalVOR-Eval with GT
PSNR21.946
8
Video Object RemovalROSE-Benchmark with GT
PSNR26.502
8
Video Object RemovalSynthetic (Kubric + HUMOTO) (test)
PSNR30.11
7
Showing 10 of 20 rows

Other info

Follow for update