Quantization without Tears

About

Deep neural networks, while achieving remarkable success across diverse tasks, demand significant resources, including computation, GPU memory, bandwidth, storage, and energy. Network quantization, as a standard compression and acceleration technique, reduces storage costs and enables potential inference acceleration by discretizing network weights and activations into a finite set of integer values. However, current quantization methods are often complex and sensitive, requiring extensive task-specific hyperparameters, where even a single misconfiguration can impair model performance, limiting generality across different models and tasks. In this paper, we propose Quantization without Tears (QwT), a method that simultaneously achieves quantization speed, accuracy, simplicity, and generality. The key insight of QwT is to incorporate a lightweight additional structure into the quantized network to mitigate information loss during quantization. This structure consists solely of a small set of linear layers, keeping the method simple and efficient. More importantly, it provides a closed-form solution, allowing us to improve accuracy effortlessly under 2 minutes. Extensive experiments across various vision, language, and multimodal tasks demonstrate that QwT is both highly effective and versatile. In fact, our approach offers a robust solution for network quantization that combines simplicity, accuracy, and adaptability, which provides new insights for the design of novel quantization paradigms. The code is publicly available at https://github.com/wujx2001/QwT

Minghao Fu, Hao Yu, Jie Shao, Junjie Zhou, Ke Zhu, Jianxin Wu• 2024

Related benchmarks

Task	Dataset	Result
Language Modeling	WikiText2	Perplexity6.63	3785
Language Modeling	C4	Perplexity9.38	1565
Image Classification	ImageNet-1k (val)	Top-1 Accuracy83.8	1498
Image Classification	ImageNet (val)	Top-1 Acc84	1206
Image Generation	ImageNet 256x256 (val)	FID5.35	399
Instance Segmentation	COCO	APmask45	291
Object Detection	COCO	AP (Box)51.8	186
Zero-shot Image Classification	ImageNet zero-shot	Top-1 Accuracy63	35
Commonsense Question Answering	Commonsense QA 8 datasets	Average QA Score65.18	3

Showing 9 of 9 rows

Other info

Code

Follow for update

@wizwand_team Discord