MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

About

We present a class of efficient models called MobileNets for mobile and embedded vision applications. MobileNets are based on a streamlined architecture that uses depth-wise separable convolutions to build light weight deep neural networks. We introduce two simple global hyper-parameters that efficiently trade off between latency and accuracy. These hyper-parameters allow the model builder to choose the right sized model for their application based on the constraints of the problem. We present extensive experiments on resource and accuracy tradeoffs and show strong performance compared to other popular models on ImageNet classification. We then demonstrate the effectiveness of MobileNets across a wide range of applications and use cases including object detection, finegrain classification, face attributes and large scale geo-localization.

Andrew G. Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, Hartwig Adam• 2017

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-100 (test)	--	3518
Image Classification	CIFAR-10 (test)	--	3381
Object Detection	COCO 2017 (val)	AP31	2930
Image Classification	ImageNet-1k (val)	Top-1 Accuracy70.6	1498
Image Classification	ImageNet-1K	Top-1 Acc70.6	1239
Object Detection	COCO (test-dev)	mAP19.3	1239
Image Classification	ImageNet (val)	Top-1 Acc70.8	1206
Classification	ImageNet-1K 1.0 (val)	Top-1 Accuracy (%)61.7	1171
Image Classification	ImageNet-1k (val)	Top-1 Accuracy70.9	960
Image Classification	ImageNet 1k (test)	--	939

Showing 10 of 108 rows

...

Other info

Follow for update

@wizwand_team Discord