Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation

About

The ability to perform pixel-wise semantic segmentation in real-time is of paramount importance in mobile applications. Recent deep neural networks aimed at this task have the disadvantage of requiring a large number of floating point operations and have long run-times that hinder their usability. In this paper, we propose a novel deep neural network architecture named ENet (efficient neural network), created specifically for tasks requiring low latency operation. ENet is up to 18$\times$ faster, requires 75$\times$ less FLOPs, has 79$\times$ less parameters, and provides similar or better accuracy to existing models. We have tested it on CamVid, Cityscapes and SUN datasets and report on comparisons with existing state-of-the-art methods, and the trade-offs between accuracy and processing time of a network. We present performance measurements of the proposed architecture on embedded systems and suggest possible software improvements that could make ENet even faster.

Adam Paszke, Abhishek Chaurasia, Sangpil Kim, Eugenio Culurciello• 2016

Related benchmarks

TaskDatasetResultRank
Semantic segmentationCityscapes (test)
mIoU80.4
1145
Semantic segmentationCityscapes (val)
mIoU58.3
572
Semantic segmentationCamVid (test)
mIoU68.3
411
Semantic segmentationCityscapes (val)
mIoU58.3
332
Semantic segmentationCityscapes (val)
mIoU58.3
287
Lane DetectionTuSimple (test)
Accuracy93.02
250
Semantic segmentationSUN RGB-D (test)
mIoU0.197
191
Semantic segmentationCityscapes (val)
mIoU58.3
108
Semantic segmentationTrans10K v2 (test)
mIoU8.5
104
Semantic segmentationMapillary Vistas (val)
mIoU47
72
Showing 10 of 28 rows

Other info

Follow for update