NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers

About

The complicated architecture and high training cost of vision transformers urge the exploration of post-training quantization. However, the heavy-tailed distribution of vision transformer activations hinders the effectiveness of previous post-training quantization methods, even with advanced quantizer designs. Instead of tuning the quantizer to better fit the complicated activation distribution, this paper proposes NoisyQuant, a quantizer-agnostic enhancement for the post-training activation quantization performance of vision transformers. We make a surprising theoretical discovery that for a given quantizer, adding a fixed Uniform noisy bias to the values being quantized can significantly reduce the quantization error under provable conditions. Building on the theoretical insight, NoisyQuant achieves the first success on actively altering the heavy-tailed activation distribution with additive noisy bias to fit a given quantizer. Extensive experiments show NoisyQuant largely improves the post-training quantization performance of vision transformer with minimal computation overhead. For instance, on linear uniform 6-bit activation quantization, NoisyQuant improves SOTA top-1 accuracy on ImageNet by up to 1.7%, 1.1% and 0.5% for ViT, DeiT, and Swin Transformer respectively, achieving on-par or even higher performance than previous nonlinear, mixed-precision quantization.

Yijiang Liu, Huanrui Yang, Zhen Dong, Kurt Keutzer, Li Du, Shanghang Zhang• 2022

Related benchmarks

Task	Dataset	Result
Image Super-resolution	Manga109	PSNR37.47	875
Image Super-resolution	Set5	PSNR37.5	774
Image Super-resolution	Set14	PSNR33.06	565
Single Image Super-Resolution	Urban100	PSNR31.31	500
Image Super-resolution	Urban100	PSNR26.66	424
Camouflaged Object Detection	COD10K (test)	S-measure (S_alpha)0.474	306
Image Classification	ImageNet (val)	--	300
Object Detection	MS-COCO 2017 (val)	mAP41.4	264
Camouflaged Object Detection	COD10K	S-measure (S_alpha)0.6618	217
Camouflaged Object Detection	Chameleon	S-measure (S_alpha)60.37	207

Showing 10 of 20 rows

Other info

Follow for update

@wizwand_team Discord