Knowledge Guided Semi-Supervised Learning for Quality Assessment of User Generated Videos

About

Perceptual quality assessment of user generated content (UGC) videos is challenging due to the requirement of large scale human annotated videos for training. In this work, we address this challenge by first designing a self-supervised Spatio-Temporal Visual Quality Representation Learning (ST-VQRL) framework to generate robust quality aware features for videos. Then, we propose a dual-model based Semi Supervised Learning (SSL) method specifically designed for the Video Quality Assessment (SSL-VQA) task, through a novel knowledge transfer of quality predictions between the two models. Our SSL-VQA method uses the ST-VQRL backbone to produce robust performances across various VQA datasets including cross-database settings, despite being learned with limited human annotated videos. Our model improves the state-of-the-art performance when trained only with limited data by around 10%, and by around 15% when unlabelled data is also used in SSL. Source codes and checkpoints are available at https://github.com/Shankhanil006/SSL-VQA.

Shankhanil Mitra, Rajiv Soundararajan• 2023

Related benchmarks

Task	Dataset	Result
Video Quality Assessment	KoNViD-1k	SROCC0.826	208
Video Quality Assessment	LIVE-VQC	SRCC0.733	151
Video Quality Assessment	YouTube-UGC	SROCC0.75	110
Video Quality Assessment	Human-AGVQA	SRCC0.632	21
Video Quality Assessment	T2VQA	SRCC0.58	21

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord