Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Scale Contrastive Learning with Selective Attentions for Blind Image Quality Assessment

About

Human visual perception naturally evaluates image quality across multiple scales, a hierarchical process that existing blind image quality assessment (BIQA) algorithms struggle to replicate effectively. This limitation stems from a fundamental misunderstanding: current multi-scale approaches fail to recognize that quality perception varies dramatically between scales -- what appears degraded when viewed closely may look acceptable from a distance. This inconsistency not only creates misleading ``visual illusions'' during feature fusion but also introduces substantial redundant information that dilutes quality-critical features and leads to imprecise assessments. Our CSFIQA framework advances multi-scale BIQA via two key innovations: (1) a selective focus attention mechanism that mimics human visual attention by filtering out redundant cross-scale information that would otherwise mask subtle quality indicators, and (2) a scale contrastive learning strategy that explicitly learns to distinguish quality variations both across and within scales. By incorporating an adaptive noise sample matching mechanism, CSFIQA effectively identifies perceptual quality discrepancies in the same content viewed at different scales. Experiments demonstrate substantial improvements over state-of-the-art methods across seven datasets, achieving up to 8.8% SRCC improvement on challenging real-world distortions, confirming CSFIQA's superior alignment with human quality perception.

Runze Hu, Zihao Huang, Xudong Li, Bohan Fu, Yan Zhang, Sicheng Zhao• 2024

Related benchmarks

TaskDatasetResultRank
Image Quality AssessmentSPAQ
SRCC0.925
191
Image Quality AssessmentCSIQ
SRC0.967
138
Image Quality AssessmentLIVE
SRC0.982
96
Image Quality AssessmentKonIQ
SRCC0.924
82
Image Quality AssessmentTID 2013
SRC0.899
74
Blind Image Quality AssessmentLIVEC
SRCC0.905
65
No-Reference Image Quality AssessmentLIVEFB
PLCC0.701
42
Image Quality AssessmentLIVEC
SRCC0.838
12
Image Quality AssessmentKonIQ (train)
Epoch Duration (s)270
3
Showing 9 of 9 rows

Other info

Follow for update