Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HyperSeg: Patch-wise Hypernetwork for Real-time Semantic Segmentation

About

We present a novel, real-time, semantic segmentation network in which the encoder both encodes and generates the parameters (weights) of the decoder. Furthermore, to allow maximal adaptivity, the weights at each decoder block vary spatially. For this purpose, we design a new type of hypernetwork, composed of a nested U-Net for drawing higher level context features, a multi-headed weight generating module which generates the weights of each block in the decoder immediately before they are consumed, for efficient memory utilization, and a primary network that is composed of novel dynamic patch-wise convolutions. Despite the usage of less-conventional blocks, our architecture obtains real-time performance. In terms of the runtime vs. accuracy trade-off, we surpass state of the art (SotA) results on popular semantic segmentation benchmarks: PASCAL VOC 2012 (val. set) and real-time semantic segmentation on Cityscapes, and CamVid. The code is available: https://nirkin.com/hyperseg.

Yuval Nirkin, Lior Wolf, Tal Hassner• 2020

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPASCAL VOC 2012 (val)
Mean IoU80.6
2040
Semantic segmentationCityscapes (test)
mIoU78.1
1145
Semantic segmentationCamVid (test)
mIoU79.1
411
Semantic segmentationCityscapes (val)
mIoU78.2
332
Referring SegmentationRefCOCO (val)
cIoU84.8
51
Referring SegmentationRefCOCO+ (val)
cIoU79
44
Semantic segmentationCityscapes (val)
mIoU78.2
38
Referring SegmentationRefCOCO (testB)
cIoU83.4
37
Referring SegmentationRefCOCO+ (testA)
cIoU0.835
30
Referring SegmentationRefCOCOg (val)
CIoU79.4
24
Showing 10 of 15 rows

Other info

Code

Follow for update