Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Light-Weight RefineNet for Real-Time Semantic Segmentation

About

We consider an important task of effective and efficient semantic image segmentation. In particular, we adapt a powerful semantic segmentation architecture, called RefineNet, into the more compact one, suitable even for tasks requiring real-time performance on high-resolution inputs. To this end, we identify computationally expensive blocks in the original setup, and propose two modifications aimed to decrease the number of parameters and floating point operations. By doing that, we achieve more than twofold model reduction, while keeping the performance levels almost intact. Our fastest model undergoes a significant speed-up boost from 20 FPS to 55 FPS on a generic GPU card on 512x512 inputs with solid 81.1% mean iou performance on the test set of PASCAL VOC, while our slowest model with 32 FPS (from original 17 FPS) shows 82.7% mean iou on the same dataset. Alternatively, we showcase that our approach is easily mixable with light-weight classification networks: we attain 79.2% mean iou on PASCAL VOC using a model that contains only 3.3M parameters and performs only 9.3B floating point operations.

Vladimir Nekrasov, Chunhua Shen, Ian Reid• 2018

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPASCAL VOC (val)
mIoU82.1
338
Semantic segmentationNYU v2 (test)
mIoU44.4
248
Semantic segmentationPascal VOC (test)
mIoU82.7
236
Semantic segmentationPascal Context (test)
mIoU45.8
176
Semantic segmentationNYU Depth V2 (test)
mIoU43.6
172
Semantic segmentationFoggy Driving (FD) (test)
mIoU43.6
56
Semantic segmentationNYU Depth V2--
26
Semantic segmentationFoggy Driving-dense (FDD) (test)
mIoU35.9
25
Semantic segmentationPASCAL-Person-Part (test)
mIoU67.6
16
Semantic segmentationFoggy Zurich (FZ) v2 (test)
mIoU28.5
16
Showing 10 of 12 rows

Other info

Follow for update