Fast Video Object Segmentation using the Global Context Module

About

We developed a real-time, high-quality semi-supervised video object segmentation algorithm. Its accuracy is on par with the most accurate, time-consuming online-learning model, while its speed is similar to the fastest template-matching method with sub-optimal accuracy. The core component of the model is a novel global context module that effectively summarizes and propagates information through the entire video. Compared to previous approaches that only use one frame or a few frames to guide the segmentation of the current frame, the global context module uses all past frames. Unlike the previous state-of-the-art space-time memory network that caches a memory at each spatio-temporal position, the global context module uses a fixed-size feature representation. Therefore, it uses constant memory regardless of the video length and costs substantially less memory and computation. With the novel module, our model achieves top performance on standard benchmarks at a real-time speed.

Yu Li, Zhuoran Shen, Ying Shan (1) __INSTITUTION_3__ Tencent PCG Applied Research Center, (2) The University of Hong Kong)• 2020

Related benchmarks

Task	Dataset	Result
Video Object Segmentation	DAVIS 2017 (val)	J mean69.3	1251
Video Object Segmentation	DAVIS 2016 (val)	J Mean87.6	564
Video Object Segmentation	YouTube-VOS 2018 (val)	J Score (Seen)72.6	493
Video Object Segmentation	YouTube-VOS 2019 (val)	J-Score (Seen)72.6	240
Semi-supervised Video Object Segmentation	DAVIS 2017 (val)	J&F Score71.4	55
Semi-supervised Video Object Segmentation	DAVIS 2016 (val)	Input J Score87.6	19

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord