Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Accurate Neural Training with 4-bit Matrix Multiplications at Standard Formats

About

Quantization of the weights and activations is one of the main methods to reduce the computational footprint of Deep Neural Networks (DNNs) training. Current methods enable 4-bit quantization of the forward phase. However, this constitutes only a third of the training process. Reducing the computational footprint of the entire training process requires the quantization of the neural gradients, i.e., the loss gradients with respect to the outputs of intermediate neural layers. Previous works separately showed that accurate 4-bit quantization of the neural gradients needs to (1) be unbiased and (2) have a log scale. However, no previous work aimed to combine both ideas, as we do in this work. Specifically, we examine the importance of having unbiased quantization in quantized neural network training, where to maintain it, and how to combine it with logarithmic quantization. Based on this, we suggest a $\textit{logarithmic unbiased quantization}$ (LUQ) method to quantize both the forward and backward phases to 4-bit, achieving state-of-the-art results in 4-bit training without the overhead. For example, in ResNet50 on ImageNet, we achieved a degradation of 1.1%. We further improve this to a degradation of only 0.32% after three epochs of high precision fine-tuning, combined with a variance reduction method -- where both these methods add overhead comparable to previously suggested methods.

Brian Chmiel, Ron Banner, Elad Hoffer, Hilla Ben Yaacov, Daniel Soudry• 2021

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR10 (test)
Accuracy94.74
585
Semantic segmentationCityscapes
mIoU70.31
578
Image ClassificationCIFAR100 (test)
Accuracy76.26
206
Image ClassificationImageNet-100 (val)
Top-1 Accuracy83.13
95
ClassificationCIFAR100
Accuracy91.76
66
Image ClassificationImageNet 1k (train)
Top-1 Accuracy67.1
58
ClassificationCIFAR10
Top-1 Accuracy97.86
38
Semantic segmentationVOC 2012
mIoU78.7
18
ClassificationMRPC
Accuracy84.56
6
Image ClassificationImageNet-100 (pre-training)
Accuracy75.97
4
Showing 10 of 10 rows

Other info

Follow for update