Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Pyramid Attention Network for Semantic Segmentation

About

A Pyramid Attention Network(PAN) is proposed to exploit the impact of global contextual information in semantic segmentation. Different from most existing works, we combine attention mechanism and spatial pyramid to extract precise dense features for pixel labeling instead of complicated dilated convolution and artificially designed decoder networks. Specifically, we introduce a Feature Pyramid Attention module to perform spatial pyramid attention structure on high-level output and combining global pooling to learn a better feature representation, and a Global Attention Upsample module on each decoder layer to provide global context as a guidance of low-level features to select category localization details. The proposed approach achieves state-of-the-art performance on PASCAL VOC 2012 and Cityscapes benchmarks with a new record of mIoU accuracy 84.0% on PASCAL VOC 2012, while training without COCO dataset.

Hanchao Li, Pengfei Xiong, Jie An, Lingxue Wang• 2018

Related benchmarks

TaskDatasetResultRank
Semantic segmentationCityscapes (test)
mIoU78.6
1145
Semantic segmentationSemanticKITTI (test)
mIoU44.4
335
Medical Image SegmentationSynapse (test)
Dice80.08
111
Semantic segmentationLoveDA
IoU (Background)43.04
60
Semantic segmentationEarthVLSet (test)
mIoU51.26
21
Hierarchical Semantic SegmentationCityscapes (val)
mIoU¹81.12
15
Medical Image SegmentationCHAOS
DSC84.34
14
Multi-organ SegmentationCHAOS MRI (3 folds)
DSC (Liver)92.13
7
Medical Image SegmentationHSVM
DSC82.26
7
Medical Image SegmentationHVSMR 2016 (5-fold cross-validation)
DSC (Myocardium)74.98
7
Showing 10 of 15 rows

Other info

Follow for update