Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

KVQ: Boosting Video Quality Assessment via Saliency-guided Local Perception

About

Video Quality Assessment (VQA), which intends to predict the perceptual quality of videos, has attracted increasing attention. Due to factors like motion blur or specific distortions, the quality of different regions in a video varies. Recognizing the region-wise local quality within a video is beneficial for assessing global quality and can guide us in adopting fine-grained enhancement or transcoding strategies. Due to the heavy cost of annotating region-wise quality, the lack of ground truth constraints from relevant datasets further complicates the utilization of local perception. Inspired by the Human Visual System (HVS) that links global quality to the local texture of different regions and their visual saliency, we propose a Kaleidoscope Video Quality Assessment (KVQ) framework, which aims to effectively assess both saliency and local texture, thereby facilitating the assessment of global quality. Our framework extracts visual saliency and allocates attention using Fusion-Window Attention (FWA) while incorporating a Local Perception Constraint (LPC) to mitigate the reliance of regional texture perception on neighboring areas. KVQ obtains significant improvements across multiple scenarios on five VQA benchmarks compared to SOTA methods. Furthermore, to assess local perception, we establish a new Local Perception Visual Quality (LPVQ) dataset with region-wise annotations. Experimental results demonstrate the capability of KVQ in perceiving local distortions. KVQ models and the LPVQ dataset will be available at https://github.com/qyp2000/KVQ.

Yunpeng Qu, Kun Yuan, Qizhi Xie, Ming Sun, Chao Zhou, Jian Wang• 2025

Related benchmarks

TaskDatasetResultRank
Video Quality AssessmentKoNViD-1k
SROCC0.909
134
Video Quality AssessmentYouTube-UGC
SROCC0.903
69
Video Quality AssessmentLIVE-VQC
SRCC0.859
64
Video Quality AssessmentLSVQ (test)
SRCC0.896
52
No-Reference Video Quality AssessmentLIVE-VQC
SRCC0.82
50
Video Quality AssessmentLSVQ 1080p
SRCC0.814
46
No-Reference Video Quality AssessmentKoNViD-1k
SRCC0.89
42
Video Quality AssessmentLIVE-VQC, KoNViD-1k, YouTube-UGC (Weighted Average)
SROCC0.868
23
Video Quality AssessmentKoNViD 1k (full)
SRCC0.89
17
Video Quality AssessmentLIVE-VQC original (full)
SRCC0.82
17
Showing 10 of 11 rows

Other info

Code

Follow for update