Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network

About

Image quality assessment (IQA) algorithm aims to quantify the human perception of image quality. Unfortunately, there is a performance drop when assessing the distortion images generated by generative adversarial network (GAN) with seemingly realistic texture. In this work, we conjecture that this maladaptation lies in the backbone of IQA models, where patch-level prediction methods use independent image patches as input to calculate their scores separately, but lack spatial relationship modeling among image patches. Therefore, we propose an Attention-based Hybrid Image Quality Assessment Network (AHIQ) to deal with the challenge and get better performance on the GAN-based IQA task. Firstly, we adopt a two-branch architecture, including a vision transformer (ViT) branch and a convolutional neural network (CNN) branch for feature extraction. The hybrid architecture combines interaction information among image patches captured by ViT and local texture details from CNN. To make the features from shallow CNN more focused on the visually salient region, a deformable convolution is applied with the help of semantic information from the ViT branch. Finally, we use a patch-wise score prediction module to obtain the final score. The experiments show that our model outperforms the state-of-the-art methods on four standard IQA datasets and AHIQ ranked first on the Full Reference (FR) track of the NTIRE 2022 Perceptual Image Quality Assessment Challenge.

Shanshan Lao, Yuan Gong, Shuwei Shi, Sidi Yang, Tianhe Wu, Jiahao Wang, Weihao Xia, Yujiu Yang• 2022

Related benchmarks

TaskDatasetResultRank
Image Quality AssessmentCSIQ
SRC0.951
138
Image Quality AssessmentLIVE
SRC0.97
96
Image Quality AssessmentTID 2013
SRC0.763
74
Perceptual Quality AssessmentHPE-Bench 1.0 (test)
SRCC0.8824
66
Image Quality AssessmentTID 2013 (full)
SROCC0.962
47
Full Reference Image Quality AssessmentTID 2013
SRCC0.962
42
Full Reference Image Quality AssessmentCSIQ-IQA
SRCC0.975
40
Image Quality AssessmentCSIQ (full)
SROCC0.975
38
Full Reference Image Quality AssessmentLIVE
PLCC0.989
33
Editing Alignment AssessmentHPE-Bench 1.0 (test)
SRCC0.5328
33
Showing 10 of 26 rows

Other info

Code

Follow for update