Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LooC: Effective Low-Dimensional Codebook for Compositional Vector Quantization

About

Vector quantization (VQ) is a prevalent and fundamental technique that discretizes continuous feature vectors by approximating them using a codebook. As the diversity and complexity of data and models continue to increase, there is an urgent need for high-capacity, yet more compact VQ methods. This paper aims to reconcile this conflict by presenting a new approach called LooC, which utilizes an effective Low-dimensional codebook for Compositional vector quantization. Firstly, LooC introduces a parameter-efficient codebook by reframing the relationship between codevectors and feature vectors, significantly expanding its solution space. Instead of individually matching codevectors with feature vectors, LooC treats them as lower-dimensional compositional units within feature vectors and combines them, resulting in a more compact codebook with improved performance. Secondly, LooC incorporates a parameter-free extrapolation-by-interpolation mechanism to enhance and smooth features during the VQ process, which allows for better preservation of details and fidelity in feature approximation. The design of LooC leads to full codebook usage, effectively utilizing the compact codebook while avoiding the problem of collapse. Thirdly, LooC can serve as a plug-and-play module for existing methods for different downstream tasks based on VQ. Finally, extensive evaluations on different tasks, datasets, and architectures demonstrate that LooC outperforms existing VQ methods, achieving state-of-the-art performance with a significantly smaller codebook.

Jie Li, Kwan-Yee K. Wong, Kai Han• 2026

Related benchmarks

TaskDatasetResultRank
Image ReconstructionFFHQ (val)
PSNR32.44
66
Image ReconstructionImageNet (val)
rFID1.01
54
Image ReconstructionCIFAR-10
LPIPS0.0285
25
Image ReconstructionMNIST--
24
Image ReconstructionMNIST (val)
L1 Loss0.0062
6
Image ReconstructionCIFAR10 (val)
L1 Loss0.0144
6
Image ReconstructionFashion-MNIST (val)
L1 Loss0.0103
4
Class-conditional image synthesisImageNet (test)
FID46.78
1
Class-conditional image synthesisLSUN churches (test)
FID15.17
1
Class-conditional image synthesisLSUN Bedroom (test)
FID17.52
1
Showing 10 of 10 rows

Other info

Follow for update