Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Video Object Segmentation from Unlabeled Videos

About

We propose a new method for video object segmentation (VOS) that addresses object pattern learning from unlabeled videos, unlike most existing methods which rely heavily on extensive annotated data. We introduce a unified unsupervised/weakly supervised learning framework, called MuG, that comprehensively captures intrinsic properties of VOS at multiple granularities. Our approach can help advance understanding of visual patterns in VOS and significantly reduce annotation burden. With a carefully-designed architecture and strong representation learning ability, our learned model can be applied to diverse VOS settings, including object-level zero-shot VOS, instance-level zero-shot VOS, and one-shot VOS. Experiments demonstrate promising performance in these settings, as well as the potential of MuG in leveraging unlabeled data to further improve the segmentation accuracy.

Xiankai Lu, Wenguan Wang, Jianbing Shen, Yu-Wing Tai, David Crandall, Steven C. H. Hoi• 2020

Related benchmarks

TaskDatasetResultRank
Video Object SegmentationDAVIS 2017 (val)
J mean52.6
1130
Video Object SegmentationYouTube-VOS 2018 (val)
J Score (Seen)71
493
One-shot Video Object SegmentationDAVIS 2016 (val)
J Mean65.7
28
Video Object SegmentationDAVIS (val)--
28
Zero-shot Video Object SegmentationDAVIS 2016 (val)
J Mean61.2
25
Video Object SegmentationYouTube-Objects (full)
J Score62.4
18
One-shot Video Object SegmentationDAVIS 2017 (val)
J&F Mean56.1
11
Zero-shot Video Object SegmentationDAVIS 2017 (test-dev)
Jaccard Mean38.9
9
Showing 8 of 8 rows

Other info

Code

Follow for update