Deep Video Inpainting

About

Video inpainting aims to fill spatio-temporal holes with plausible content in a video. Despite tremendous progress of deep neural networks for image inpainting, it is challenging to extend these methods to the video domain due to the additional time dimension. In this work, we propose a novel deep network architecture for fast video inpainting. Built upon an image-based encoder-decoder model, our framework is designed to collect and refine information from neighbor frames and synthesize still-unknown regions. At the same time, the output is enforced to be temporally consistent by a recurrent feedback and a temporal memory module. Compared with the state-of-the-art image inpainting algorithm, our method produces videos that are much more semantically correct and temporally smooth. In contrast to the prior video completion method which relies on time-consuming optimization, our method runs in near real-time while generating competitive video results. Finally, we applied our framework to video retargeting task, and obtain visually pleasing results.

Dahun Kim, Sanghyun Woo, Joon-Young Lee, In So Kweon• 2019

Related benchmarks

Task	Dataset	Result
Video Inpainting	DAVIS (test)	PSNR28.96	54
Video Inpainting	DAVIS square mask (test)	PSNR28.32	14
Video Inpainting	Youtube-VOS square mask (test)	PSNR29.83	14
Video Inpainting	DAVIS object mask (test)	PSNR28.47	14
Video Completion	DAVIS	Ewarp0.1785	11
Video Inpainting	DAVIS	PSNR28.96	10
offline video inpainting	YouTube-VOS (test)	PSNR29.2	10
Video Completion (Object Masks)	DAVIS 29-sequence 2017 (test)	PSNR28.07	10
Video Completion (Stationary Masks)	DAVIS 90-sequence 2017 (train val)	PSNR25.19	10
Video Completion	Youtube-VOS	Ewarp0.149	8

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord