B-CANF: Adaptive B-frame Coding with Conditional Augmented Normalizing Flows
About
Over the past few years, learning-based video compression has become an active research area. However, most works focus on P-frame coding. Learned B-frame coding is under-explored and more challenging. This work introduces a novel B-frame coding framework, termed B-CANF, that exploits conditional augmented normalizing flows for B-frame coding. B-CANF additionally features two novel elements: frame-type adaptive coding and B*-frames. Our frame-type adaptive coding learns better bit allocation for hierarchical B-frame coding by dynamically adapting the feature distributions according to the B-frame type. Our B*-frames allow greater flexibility in specifying the group-of-pictures (GOP) structure by reusing the B-frame codec to mimic P-frame coding, without the need for an additional, separate P-frame codec. On commonly used datasets, B-CANF achieves the state-of-the-art compression performance as compared to the other learned B-frame codecs and shows comparable BD-rate results to HM-16.23 under the random access configuration in terms of PSNR. When evaluated on different GOP structures, our B*-frames achieve similar performance to the additional use of a separate P-frame codec.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Compression | HEVC Class D | BD-Rate-0.7 | 74 | |
| Video Compression | MCL-JCV | BD-Rate (PSNR)18.9 | 60 | |
| Video Compression | HEVC Class B | BD-Rate (%)21.8 | 58 | |
| Video Compression | HEVC Class E | BD-Rate (%)10.2 | 53 | |
| Video Compression | UVG | BD-Rate (PSNR)13.9 | 49 | |
| Video Compression | UVG (test) | BD-Bitrate (PSNR)21 | 30 | |
| Video Compression | MCL-JCV (test) | BD-Bitrate (PSNR)22.6 | 26 | |
| Video Compression | HEVC Class B (test) | BD-Bitrate (PSNR)13.9 | 25 | |
| Video Compression | HEVC ClassB | BD-Rate (MS-SSIM)-31.9 | 17 | |
| Video Compression | HEVC Class D (test) | BD-Rate (PSNR)5.1 | 16 |