Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Full-Duplex Strategy for Video Object Segmentation

About

Previous video object segmentation approaches mainly focus on using simplex solutions between appearance and motion, limiting feature collaboration efficiency among and across these two cues. In this work, we study a novel and efficient full-duplex strategy network (FSNet) to address this issue, by considering a better mutual restraint scheme between motion and appearance in exploiting the cross-modal features from the fusion and decoding stage. Specifically, we introduce the relational cross-attention module (RCAM) to achieve bidirectional message propagation across embedding sub-spaces. To improve the model's robustness and update the inconsistent features from the spatial-temporal embeddings, we adopt the bidirectional purification module (BPM) after the RCAM. Extensive experiments on five popular benchmarks show that our FSNet is robust to various challenging scenarios (e.g., motion blur, occlusion) and achieves favourable performance against existing cutting-edges both in the video object segmentation and video salient object detection tasks. The project is publicly available at: https://dpfan.net/FSNet.

Ge-Peng Ji, Deng-Ping Fan, Keren Fu, Zhe Wu, Jianbing Shen, Ling Shao• 2021

Related benchmarks

TaskDatasetResultRank
Video Object SegmentationDAVIS 2016 (val)
J Mean83.4
564
Unsupervised Video Object SegmentationDAVIS 2016 (val)
F Mean83.1
108
Video Salient Object DetectionDAVIS 16 (val)
MAE2
39
Video Salient Object DetectionDAVSOD (test)
Sa77.3
32
Video Salient Object DetectionFBMS
F-beta Score (Fβ)0.888
31
Video Salient Object DetectionFBMS (test)
F-score88.8
30
Video Polyp SegmentationSUN-SEG Easy (test)
Dice70.2
28
Video Polyp SegmentationSUN-SEG Hard (test)
Dice0.699
28
Video Polyp SegmentationSUN-SEG Hard (Unseen)
S Alpha Score72.4
27
Video Polyp SegmentationSUN-SEG Easy (Unseen)
S-alpha Score0.725
27
Showing 10 of 22 rows

Other info

Follow for update