CANF-VC: Conditional Augmented Normalizing Flows for Video Compression

About

This paper presents an end-to-end learning-based video compression system, termed CANF-VC, based on conditional augmented normalizing flows (CANF). Most learned video compression systems adopt the same hybrid-based coding architecture as the traditional codecs. Recent research on conditional coding has shown the sub-optimality of the hybrid-based coding and opens up opportunities for deep generative models to take a key role in creating new coding frameworks. CANF-VC represents a new attempt that leverages the conditional ANF to learn a video generative model for conditional inter-frame coding. We choose ANF because it is a special type of generative model, which includes variational autoencoder as a special case and is able to achieve better expressiveness. CANF-VC also extends the idea of conditional coding to motion coding, forming a purely conditional coding framework. Extensive experimental results on commonly used datasets confirm the superiority of CANF-VC to the state-of-the-art methods. The source code of CANF-VC is available at https://github.com/NYCU-MAPL/CANF-VC.

Yung-Han Ho, Chih-Peng Chang, Peng-Yu Chen, Alessandro Gnutti, Wen-Hsiao Peng• 2022

Related benchmarks

Task	Dataset	Result
Video Compression	MCL-JCV	BD-Rate (PSNR)60.5	92
Video Compression	HEVC Class D	BD-Rate52.8	74
Video Compression	HEVC Class B	BD-Rate (%)56.4	63
Video Compression	HEVC Class C	BD-Rate (%)70.5	61
Video Compression	HEVC Class E	BD-Rate (%)118	60
Video Compression	UVG	--	55
Video Compression	Standard Video Compression Suite UVG, MCL-JCV, HEVC B/C/D/E/RGB	UVG Score31.2	21
Video Compression	HEVC RGB	BD-Rate (PSNR)79.9	19
Video Compression	HEVC Class D RGB	BD-Rate (MS-SSIM)17.9	16
Video Compression	HEVC Class C (RGB)	BD-Rate (MS-SSIM)30.9	16

Showing 10 of 16 rows

Other info

Follow for update

@wizwand_team Discord