Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Neural B-frame Video Compression with Bi-directional Reference Harmonization

About

Neural video compression (NVC) has made significant progress in recent years, while neural B-frame video compression (NBVC) remains underexplored compared to P-frame compression. NBVC can adopt bi-directional reference frames for better compression performance. However, NBVC's hierarchical coding may complicate continuous temporal prediction, especially at some hierarchical levels with a large frame span, which could cause the contribution of the two reference frames to be unbalanced. To optimize reference information utilization, we propose a novel NBVC method, termed Bi-directional Reference Harmonization Video Compression (BRHVC), with the proposed Bi-directional Motion Converge (BMC) and Bi-directional Contextual Fusion (BCF). BMC converges multiple optical flows in motion compression, leading to more accurate motion compensation on a larger scale. Then BCF explicitly models the weights of reference contexts under the guidance of motion compensation accuracy. With more efficient motions and contexts, BRHVC can effectively harmonize bi-directional references. Experimental results indicate that our BRHVC outperforms previous state-of-the-art NVC methods, even surpassing the traditional coding, VTM-RA (under random access configuration), on the HEVC datasets. The source code is released at https://github.com/kwai/NVC.

Yuxi Liu, Dengchao Jin, Shuai Huo, Jiawen Gu, Chao Zhou, Huihui Bai, Ming Lu, Zhan Ma• 2025

Related benchmarks

TaskDatasetResultRank
Video CompressionMCL-JCV--
79
Video CompressionHEVC Class D
BD-Rate-18.8
23
Video CompressionHEVC Class E
BD-Rate-7.67
23
Video CompressionHEVC Class B
BD-Rate14.2
23
Video CompressionHEVC Class C
BD-Rate7.43
23
Video CompressionUVG
BD-Rate16.83
23
Showing 6 of 6 rows

Other info

Follow for update