Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MobileOne: An Improved One millisecond Mobile Backbone

About

Efficient neural network backbones for mobile devices are often optimized for metrics such as FLOPs or parameter count. However, these metrics may not correlate well with latency of the network when deployed on a mobile device. Therefore, we perform extensive analysis of different metrics by deploying several mobile-friendly networks on a mobile device. We identify and analyze architectural and optimization bottlenecks in recent efficient neural networks and provide ways to mitigate these bottlenecks. To this end, we design an efficient backbone MobileOne, with variants achieving an inference time under 1 ms on an iPhone12 with 75.9% top-1 accuracy on ImageNet. We show that MobileOne achieves state-of-the-art performance within the efficient architectures while being many times faster on mobile. Our best model obtains similar performance on ImageNet as MobileFormer while being 38x faster. Our model obtains 2.3% better top-1 accuracy on ImageNet than EfficientNet at similar latency. Furthermore, we show that our model generalizes to multiple tasks - image classification, object detection, and semantic segmentation with significant improvements in latency and accuracy as compared to existing efficient architectures when deployed on a mobile device. Code and models are available at https://github.com/apple/ml-mobileone

Pavan Kumar Anasosalu Vasu, James Gabriel, Jeff Zhu, Oncel Tuzel, Anurag Ranjan• 2022

Related benchmarks

TaskDatasetResultRank
Semantic segmentationADE20K (val)
mIoU38.2
2731
Image ClassificationImageNet A
Top-1 Acc10.8
553
Image ClassificationImageNet-1K
Top-1 Acc79.4
524
Image ClassificationImageNet-R
Top-1 Acc44.8
474
Image ClassificationImageNet-Sketch
Top-1 Accuracy29.2
360
Image ClassificationImageNet-1k (val)
Top-1 Acc69
287
Semantic segmentationPascal VOC (test)
mIoU80.1
236
Image ClassificationImageNet-1k 1.0 (test)
Top-1 Accuracy0.794
191
Image ClassificationImageNet-1k (val)
Top-1 Acc79.4
188
Image ClassificationImageNet (val)
Top-1 Accuracy79.4
188
Showing 10 of 19 rows

Other info

Code

Follow for update