Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaptation

About

Although recent generative image compression methods have demonstrated impressive potential in optimizing the rate-distortion-perception trade-off, they still face the critical challenge of flexible rate adaption to diverse compression necessities and scenarios. To overcome this challenge, this paper proposes a Controllable Generative Image Compression framework, termed Control-GIC, the first capable of fine-grained bitrate adaption across a broad spectrum while ensuring high-fidelity and generality compression. Control-GIC is grounded in a VQGAN framework that encodes an image as a sequence of variable-length codes (i.e. VQ-indices), which can be losslessly compressed and exhibits a direct positive correlation with the bitrates. Drawing inspiration from the classical coding principle, we correlate the information density of local image patches with their granular representations. Hence, we can flexibly determine a proper allocation of granularity for the patches to achieve dynamic adjustment for VQ-indices, resulting in desirable compression rates. We further develop a probabilistic conditional decoder capable of retrieving historic encoded multi-granularity representations according to transmitted codes, and then reconstruct hierarchical granular features in the formalization of conditional probability, enabling more informative aggregation to improve reconstruction realism. Our experiments show that Control-GIC allows highly flexible and controllable bitrate adaption where the results demonstrate its superior performance over recent state-of-the-art methods. Code is available at https://github.com/lianqi1008/Control-GIC.

Anqi Li, Feng Li, Yuxi Liu, Runmin Cong, Yao Zhao, Huihui Bai• 2024

Related benchmarks

TaskDatasetResultRank
Image CompressionTecnick--
44
Image CompressionKodak (test)--
32
Image CompressionKodak
BD-Rate (DISTS)5.42e+3
17
Image CompressionTecnick (test)
BD-rate (LPIPS)68.83
10
Image CompressionKodak
BD-DISTS34.18
10
Image CompressionCLIC Professional 2020
BD-rate (LPIPS)136.3
9
Image CompressionDIV2K (test)--
9
Image CompressionCLIC
BD-Rate (DISTS)110.8
6
Showing 8 of 8 rows

Other info

Follow for update