Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Fast Video Object Segmentation using the Global Context Module

About

We developed a real-time, high-quality semi-supervised video object segmentation algorithm. Its accuracy is on par with the most accurate, time-consuming online-learning model, while its speed is similar to the fastest template-matching method with sub-optimal accuracy. The core component of the model is a novel global context module that effectively summarizes and propagates information through the entire video. Compared to previous approaches that only use one frame or a few frames to guide the segmentation of the current frame, the global context module uses all past frames. Unlike the previous state-of-the-art space-time memory network that caches a memory at each spatio-temporal position, the global context module uses a fixed-size feature representation. Therefore, it uses constant memory regardless of the video length and costs substantially less memory and computation. With the novel module, our model achieves top performance on standard benchmarks at a real-time speed.

Yu Li, Zhuoran Shen, Ying Shan (1) __INSTITUTION_3__ Tencent PCG Applied Research Center, (2) The University of Hong Kong)• 2020

Related benchmarks

TaskDatasetResultRank
Video Object SegmentationDAVIS 2017 (val)
J mean69.3
1130
Video Object SegmentationDAVIS 2016 (val)
J Mean87.6
564
Video Object SegmentationYouTube-VOS 2018 (val)
J Score (Seen)72.6
493
Video Object SegmentationYouTube-VOS 2019 (val)
J-Score (Seen)72.6
231
Semi-supervised Video Object SegmentationDAVIS 2017 (val)
J&F Score71.4
31
Semi-supervised Video Object SegmentationDAVIS 2016 (val)
Input J Score87.6
19
Showing 6 of 6 rows

Other info

Follow for update