Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion

About

Existing multimodal large model-based image compression frameworks often rely on a fragmented integration of semantic retrieval, latent compression, and generative models, resulting in suboptimal performance in both reconstruction fidelity and coding efficiency. To address these challenges, we propose a residual-guided ultra lowrate image compression named ResULIC, which incorporates residual signals into both semantic retrieval and the diffusion-based generation process. Specifically, we introduce Semantic Residual Coding (SRC) to capture the semantic disparity between the original image and its compressed latent representation. A perceptual fidelity optimizer is further applied for superior reconstruction quality. Additionally, we present the Compression-aware Diffusion Model (CDM), which establishes an optimal alignment between bitrates and diffusion time steps, improving compression-reconstruction synergy. Extensive experiments demonstrate the effectiveness of ResULIC, achieving superior objective and subjective performance compared to state-of-the-art diffusion-based methods with - 80.7%, -66.3% BD-rate saving in terms of LPIPS and FID. Project page is available at https: //njuvision.github.io/ResULIC/.

Anle Ke, Xu Zhang, Tong Chen, Ming Lu, Chao Zhou, Jiawen Gu, Zhan Ma• 2025

Related benchmarks

TaskDatasetResultRank
Image CompressionKodak (test)--
32
Image CompressionKodak
BD-Rate (DISTS)-65.31
17
Image CompressionCLIC 2020
BD-rate (LPIPS)-66.5
13
Image CompressionDIV2K
BD-Rate (LPIPS)-62.93
11
Image CompressionDIV2K (test)
BD-DISTS37.75
9
Image CompressionCLIC 2020 (test)
BD-DISTS53.15
9
Showing 6 of 6 rows

Other info

Follow for update