Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TOPIQ: A Top-down Approach from Semantics to Distortions for Image Quality Assessment

About

Image Quality Assessment (IQA) is a fundamental task in computer vision that has witnessed remarkable progress with deep neural networks. Inspired by the characteristics of the human visual system, existing methods typically use a combination of global and local representations (\ie, multi-scale features) to achieve superior performance. However, most of them adopt simple linear fusion of multi-scale features, and neglect their possibly complex relationship and interaction. In contrast, humans typically first form a global impression to locate important regions and then focus on local details in those regions. We therefore propose a top-down approach that uses high-level semantics to guide the IQA network to focus on semantically important local distortion regions, named as \emph{TOPIQ}. Our approach to IQA involves the design of a heuristic coarse-to-fine network (CFANet) that leverages multi-scale features and progressively propagates multi-level semantic information to low-level representations in a top-down manner. A key component of our approach is the proposed cross-scale attention mechanism, which calculates attention maps for lower level features guided by higher level features. This mechanism emphasizes active semantic regions for low-level distortions, thereby improving performance. CFANet can be used for both Full-Reference (FR) and No-Reference (NR) IQA. We use ResNet50 as its backbone and demonstrate that CFANet achieves better or competitive performance on most public FR and NR benchmarks compared with state-of-the-art methods based on vision transformers, while being much more efficient (with only ${\sim}13\%$ FLOPS of the current best FR method). Codes are released at \url{https://github.com/chaofengc/IQA-PyTorch}.

Chaofeng Chen, Jiadi Mo, Jingwen Hou, Haoning Wu, Liang Liao, Wenxiu Sun, Qiong Yan, Weisi Lin• 2023

Related benchmarks

TaskDatasetResultRank
Image Quality AssessmentCSIQ
SRC0.969
138
Image Quality AssessmentTID 2013 (test)
Mean SRCC0.954
124
Image Quality AssessmentCSIQ (test)
SRCC0.978
103
Image Quality AssessmentLIVE
SRC0.974
96
Image Quality AssessmentKADID
SRCC0.895
95
Blind Image Quality AssessmentFLIVE
SRCC0.5796
86
No-Reference Image Quality AssessmentKonIQ-10k
SROCC0.928
73
No-Reference Image Quality AssessmentCSIQ
SROCC0.894
73
Perceptual Quality AssessmentHPE-Bench 1.0 (test)
SRCC0.8026
66
Image Aesthetic AssessmentAVA
SRCC0.791
53
Showing 10 of 50 rows

Other info

Code

Follow for update