Uni-LVC: A Unified Method for Intra- and Inter-Mode Learned Video Compression
About
Recent advances in learned video compression (LVC) have led to significant performance gains, with codecs such as DCVC-RT surpassing the H.266/VVC low-delay mode in compression efficiency. However, existing LVCs still exhibit key limitations: they often require separate models for intra and inter coding modes, and their performance degrades when temporal references are unreliable. To address this, we introduce Uni-LVC, a unified LVC method that supports both intra and inter coding with low-delay and random-access in a single model. Building on a strong intra-codec, Uni-LVC formulates inter-coding as intra-coding conditioned on temporal information extracted from reference frames. We design an efficient cross-attention adaptation module that integrates temporal cues, enabling seamless support for both unidirectional (low-delay) and bidirectional (random-access) prediction modes. A reliability-aware classifier is proposed to selectively scale the temporal cues, making Uni-LVC behave closer to intra coding when references are unreliable. We further propose a multistage training strategy to facilitate adaptive learning across various coding modes. Extensive experiments demonstrate that Uni-LVC achieves superior rate-distortion performance in intra and inter configurations while maintaining comparable computational efficiency.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Compression | MCL-JCV | -- | 79 | |
| Video Compression | HEVC Class B | BD-Rate-20.99 | 23 | |
| Video Compression | UVG | BD-Rate-25.51 | 23 | |
| Video Compression | HEVC Class D | BD-Rate-29.47 | 23 | |
| Video Compression | HEVC Class C | BD-Rate-18.3 | 23 | |
| Video Compression | HEVC Class E | BD-Rate-22.99 | 23 |