Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Efficient Video Object Segmentation via Network Modulation

About

Video object segmentation targets at segmenting a specific object throughout a video sequence, given only an annotated first frame. Recent deep learning based approaches find it effective by fine-tuning a general-purpose segmentation model on the annotated frame using hundreds of iterations of gradient descent. Despite the high accuracy these methods achieve, the fine-tuning process is inefficient and fail to meet the requirements of real world applications. We propose a novel approach that uses a single forward pass to adapt the segmentation model to the appearance of a specific object. Specifically, a second meta neural network named modulator is learned to manipulate the intermediate layers of the segmentation network given limited visual and spatial information of the target object. The experiments show that our approach is 70times faster than fine-tuning approaches while achieving similar accuracy.

Linjie Yang, Yanran Wang, Xuehan Xiong, Jianchao Yang, Aggelos K. Katsaggelos• 2018

Related benchmarks

TaskDatasetResultRank
Video Object SegmentationDAVIS 2017 (val)
J mean55.1
1130
Video Instance SegmentationYouTube-VIS 2019 (val)
AP27.5
567
Video Object SegmentationDAVIS 2016 (val)
J Mean74
564
Video Object SegmentationYouTube-VOS 2018 (val)
J Score (Seen)60
493
Video Object SegmentationDAVIS 2017 (test-dev)
Region J Mean37.7
237
Video Instance SegmentationYouTube-VIS (val)
AP27.5
118
Video Object SegmentationDAVIS 2017 (test)
J (Jaccard Index)37.7
107
Video Object SegmentationYouTube-VOS (val)
J Score (Seen)60
81
Video Object SegmentationYouTube-Objects
mIoU69
50
Video Object SegmentationDAVIS 2016--
44
Showing 10 of 20 rows

Other info

Code

Follow for update