Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

QMamba: On First Exploration of Vision Mamba for Image Quality Assessment

About

In this work, we take the first exploration of the recently popular foundation model, i.e., State Space Model/Mamba, in image quality assessment (IQA), aiming at observing and excavating the perception potential in vision Mamba. A series of works on Mamba has shown its significant potential in various fields, e.g., segmentation and classification. However, the perception capability of Mamba remains under-explored. Consequently, we propose QMamba by revisiting and adapting the Mamba model for three crucial IQA tasks, i.e., task-specific, universal, and transferable IQA, which reveals its clear advantages over existing foundational models, e.g., Swin Transformer, ViT, and CNNs, in terms of perception and computational cost. To improve the transferability of QMamba, we propose the StylePrompt tuning paradigm, where lightweight mean and variance prompts are injected to assist task-adaptive transfer learning of pre-trained QMamba for different downstream IQA tasks. Compared with existing prompt tuning strategies, our StylePrompt enables better perceptual transfer with lower computational cost. Extensive experiments on multiple synthetic, authentic IQA datasets, and cross IQA datasets demonstrate the effectiveness of our proposed QMamba. The code will be available at: https://github.com/bingo-G/QMamba.git

Fengbin Guan, Xin Li, Zihao Yu, Yiting Lu, Zhibo Chen• 2024

Related benchmarks

TaskDatasetResultRank
Image Quality AssessmentSPAQ
SRCC0.927
275
No-Reference Image Quality AssessmentKADID-10K
SROCC0.923
146
Blind Image Quality AssessmentFLIVE
SRCC0.574
127
Blind Image Quality AssessmentLIVEC
SRCC0.863
79
Image Quality AssessmentTID 2013
PLCC0.952
55
No-Reference Image Quality AssessmentLIVEFB
PLCC0.672
48
Blind Image Quality AssessmentLIVE
SRCC0.959
42
Blind Image Quality AssessmentKonIQ
SRCC0.928
29
Blind Image Quality AssessmentCSIQ
SRCC91.6
12
Showing 9 of 9 rows

Other info

Follow for update