Semantic Segmentation with Reverse Attention

About

Recent development in fully convolutional neural network enables efficient end-to-end learning of semantic segmentation. Traditionally, the convolutional classifiers are taught to learn the representative semantic features of labeled semantic objects. In this work, we propose a reverse attention network (RAN) architecture that trains the network to capture the opposite concept (i.e., what are not associated with a target class) as well. The RAN is a three-branch network that performs the direct, reverse and reverse-attention learning processes simultaneously. Extensive experiments are conducted to show the effectiveness of the RAN in semantic segmentation. Being built upon the DeepLabv2-LargeFOV, the RAN achieves the state-of-the-art mIoU score (48.1%) for the challenging PASCAL-Context dataset. Significant performance improvements are also observed for the PASCAL-VOC, Person-Part, NYUDv2 and ADE20K datasets.

Qin Huang, Chunyang Xia, Chihao Wu, Siyang Li, Ye Wang, Yuhang Song, C.-C. Jay Kuo• 2017

Related benchmarks

Task	Dataset	Result
Semantic segmentation	ADE20K	mIoU35.3	1028
Semantic segmentation	Pascal Context (test)	mIoU48.1	223
Semantic segmentation	NYU Depth V2 (test)	mIoU41.2	183
Semantic Part Segmentation	PASCAL-Person-Part (val)	mIoU66.6	17

Showing 4 of 4 rows

Other info

Code

Follow for update

@wizwand_team Discord