Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

UDC: Unified DNAS for Compressible TinyML Models

About

Deploying TinyML models on low-cost IoT hardware is very challenging, due to limited device memory capacity. Neural processing unit (NPU) hardware address the memory challenge by using model compression to exploit weight quantization and sparsity to fit more parameters in the same footprint. However, designing compressible neural networks (NNs) is challenging, as it expands the design space across which we must make balanced trade-offs. This paper demonstrates Unified DNAS for Compressible (UDC) NNs, which explores a large search space to generate state-of-the-art compressible NNs for NPU. ImageNet results show UDC networks are up to $3.35\times$ smaller (iso-accuracy) or 6.25% more accurate (iso-model size) than previous work.

Igor Fedorov, Ramon Matas, Hokchhay Tann, Chuteng Zhou, Matthew Mattina, Paul Whatmough• 2022

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100 (test)--
3518
Image ClassificationImageNet (val)
Top-1 Accuracy72.05
188
Image Super-resolutionSet14
PSNR27.98
115
Super-ResolutionDIV2K
PSNR29.79
101
Super-ResolutionSet5
PSNR31.31
82
Neural Architecture SearchImageNet (train)
GPU Days (GPUD)6.4
11
Showing 6 of 6 rows

Other info

Follow for update