Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

About

We present a class of efficient models called MobileNets for mobile and embedded vision applications. MobileNets are based on a streamlined architecture that uses depth-wise separable convolutions to build light weight deep neural networks. We introduce two simple global hyper-parameters that efficiently trade off between latency and accuracy. These hyper-parameters allow the model builder to choose the right sized model for their application based on the constraints of the problem. We present extensive experiments on resource and accuracy tradeoffs and show strong performance compared to other popular models on ImageNet classification. We then demonstrate the effectiveness of MobileNets across a wide range of applications and use cases including object detection, finegrain classification, face attributes and large scale geo-localization.

Andrew G. Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, Hartwig Adam• 2017

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100 (test)--
3518
Image ClassificationCIFAR-10 (test)--
3381
Object DetectionCOCO 2017 (val)
AP31
2643
Image ClassificationImageNet-1k (val)
Top-1 Accuracy70.6
1469
Image ClassificationImageNet-1K
Top-1 Acc70.6
1239
Object DetectionCOCO (test-dev)
mAP19.3
1239
Image ClassificationImageNet (val)
Top-1 Acc70.8
1206
ClassificationImageNet-1K 1.0 (val)
Top-1 Accuracy (%)61.7
1163
Image ClassificationCIFAR-10 (test)--
906
Image ClassificationImageNet 1k (test)--
848
Showing 10 of 95 rows
...

Other info

Follow for update