Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Fully Dense Neural Networks for Image Semantic Segmentation

About

Semantic segmentation is pixel-wise classification which retains critical spatial information. The "feature map reuse" has been commonly adopted in CNN based approaches to take advantage of feature maps in the early layers for the later spatial reconstruction. Along this direction, we go a step further by proposing a fully dense neural network with an encoder-decoder structure that we abbreviate as FDNet. For each stage in the decoder module, feature maps of all the previous blocks are adaptively aggregated to feed-forward as input. On the one hand, it reconstructs the spatial boundaries accurately. On the other hand, it learns more efficiently with the more efficient gradient backpropagation. In addition, we propose the boundary-aware loss function to focus more attention on the pixels near the boundary, which boosts the "hard examples" labeling. We have demonstrated the best performance of the FDNet on the two benchmark datasets: PASCAL VOC 2012, NYUDv2 over previous works when not considering training on other datasets.

Mingmin Zhen, Jinglu Wang, Lei Zhou, Tian Fang, Long Quan• 2019

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPASCAL VOC 2012 (val)
Mean IoU84.1
2040
Semantic segmentationPASCAL VOC 2012 (test)
mIoU84.2
1342
Semantic segmentationNYU v2 (test)
mIoU47.4
248
Semantic segmentationNYUDv2 40-class (test)
mIoU47.4
99
Showing 4 of 4 rows

Other info

Follow for update