Fast and Accurate Online Video Object Segmentation via Tracking Parts

About

Online video object segmentation is a challenging task as it entails to process the image sequence timely and accurately. To segment a target object through the video, numerous CNN-based methods have been developed by heavily finetuning on the object mask in the first frame, which is time-consuming for online applications. In this paper, we propose a fast and accurate video object segmentation algorithm that can immediately start the segmentation process once receiving the images. We first utilize a part-based tracking method to deal with challenging factors such as large deformation, occlusion, and cluttered background. Based on the tracked bounding boxes of parts, we construct a region-of-interest segmentation network to generate part masks. Finally, a similarity-based scoring function is adopted to refine these object parts by comparing them to the visual information in the first frame. Our method performs favorably against state-of-the-art algorithms in accuracy on the DAVIS benchmark dataset, while achieving much faster runtime performance.

Jingchun Cheng, Yi-Hsuan Tsai, Wei-Chih Hung, Shengjin Wang, Ming-Hsuan Yang• 2018

Related benchmarks

Task	Dataset	Result
Video Object Segmentation	DAVIS 2017 (val)	J mean54.6	1226
Video Object Segmentation	DAVIS 2016 (val)	J Mean82.4	564
Video Object Segmentation	DAVIS 2017 (test-dev)	Region J Mean42.9	237
Semi-supervised Video Object Segmentation	DAVIS 2016 (val)	Input J Score82.4	19
Video Object Segmentation	DAVIS Challenge 2019 (val)	J&F Mean58.2	8
Video Object Segmentation	DAVIS Challenge 2019 (test-dev)	J&F Mean43.6	7

Showing 6 of 6 rows

Other info

Code

Follow for update

@wizwand_team Discord