Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Progressively Normalized Self-Attention Network for Video Polyp Segmentation

About

Existing video polyp segmentation (VPS) models typically employ convolutional neural networks (CNNs) to extract features. However, due to their limited receptive fields, CNNs can not fully exploit the global temporal and spatial information in successive video frames, resulting in false-positive segmentation results. In this paper, we propose the novel PNS-Net (Progressively Normalized Self-attention Network), which can efficiently learn representations from polyp videos with real-time speed (~140fps) on a single RTX 2080 GPU and no post-processing. Our PNS-Net is based solely on a basic normalized self-attention block, equipping with recurrence and CNNs entirely. Experiments on challenging VPS datasets demonstrate that the proposed PNS-Net achieves state-of-the-art performance. We also conduct extensive experiments to study the effectiveness of the channel split, soft-attention, and progressive learning strategy. We find that our PNS-Net works well under different settings, making it a promising solution to the VPS task.

Ge-Peng Ji, Yu-Cheng Chou, Deng-Ping Fan, Geng Chen, Huazhu Fu, Debesh Jha, Ling Shao• 2021

Related benchmarks

TaskDatasetResultRank
Video Camouflaged Object DetectionCAD (test)
Fw36.9
37
Video Polyp SegmentationSUN-SEG Hard (test)
Dice0.675
28
Video Polyp SegmentationSUN-SEG Easy (test)
Dice67.6
28
Video Polyp SegmentationSUN-SEG Easy (Unseen)
S-alpha Score76.7
27
Video Polyp SegmentationSUN-SEG Hard (Unseen)
S Alpha Score76.7
27
Video Polyp SegmentationCVC-300-TV (test)
92.1
21
Video Polyp SegmentationSUN-SEG Easy (Seen)
S_alpha90.6
21
Polyp Video SegmentationSUN-SEG Seen-Hard
S_alpha87
12
Camouflaged Object DetectionCAD (full)
Sa65.5
10
Video Camouflaged Object DetectionMoCA-Mask w/ pseudo labels (test)
S-alpha0.576
10
Showing 10 of 19 rows

Other info

Code

Follow for update