Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Knowledge Guided Semi-Supervised Learning for Quality Assessment of User Generated Videos

About

Perceptual quality assessment of user generated content (UGC) videos is challenging due to the requirement of large scale human annotated videos for training. In this work, we address this challenge by first designing a self-supervised Spatio-Temporal Visual Quality Representation Learning (ST-VQRL) framework to generate robust quality aware features for videos. Then, we propose a dual-model based Semi Supervised Learning (SSL) method specifically designed for the Video Quality Assessment (SSL-VQA) task, through a novel knowledge transfer of quality predictions between the two models. Our SSL-VQA method uses the ST-VQRL backbone to produce robust performances across various VQA datasets including cross-database settings, despite being learned with limited human annotated videos. Our model improves the state-of-the-art performance when trained only with limited data by around 10%, and by around 15% when unlabelled data is also used in SSL. Source codes and checkpoints are available at https://github.com/Shankhanil006/SSL-VQA.

Shankhanil Mitra, Rajiv Soundararajan• 2023

Related benchmarks

TaskDatasetResultRank
Video Quality AssessmentKoNViD-1k
SROCC0.826
183
Video Quality AssessmentLIVE-VQC
SRCC0.733
111
Video Quality AssessmentYouTube-UGC
SROCC0.75
86
Video Quality AssessmentHuman-AGVQA
SRCC0.632
21
Video Quality AssessmentT2VQA
SRCC0.58
21
Showing 5 of 5 rows

Other info

Follow for update