Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MS-SCANet: A Multiscale Transformer-Based Architecture with Dual Attention for No-Reference Image Quality Assessment

About

We present the Multi-Scale Spatial Channel Attention Network (MS-SCANet), a transformer-based architecture designed for no-reference image quality assessment (IQA). MS-SCANet features a dual-branch structure that processes images at multiple scales, effectively capturing both fine and coarse details, an improvement over traditional single-scale methods. By integrating tailored spatial and channel attention mechanisms, our model emphasizes essential features while minimizing computational complexity. A key component of MS-SCANet is its cross-branch attention mechanism, which enhances the integration of features across different scales, addressing limitations in previous approaches. We also introduce two new consistency loss functions, Cross-Branch Consistency Loss and Adaptive Pooling Consistency Loss, which maintain spatial integrity during feature scaling, outperforming conventional linear and bilinear techniques. Extensive evaluations on datasets like KonIQ-10k, LIVE, LIVE Challenge, and CSIQ show that MS-SCANet consistently surpasses state-of-the-art methods, offering a robust framework with stronger correlations with subjective human scores.

Mayesha Maliha R. Mithila, Mylene C.Q. Farias• 2026

Related benchmarks

TaskDatasetResultRank
No-Reference Image Quality AssessmentKonIQ-10k
SROCC0.909
73
No-Reference Image Quality AssessmentCSIQ
SROCC0.925
73
No-Reference Image Quality AssessmentLIVE
SROCC0.964
53
No-Reference Image Quality AssessmentLIVE-C
PLCC0.903
23
No-Reference Image Quality AssessmentAverage across all datasets
PLCC0.928
9
Showing 5 of 5 rows

Other info

Follow for update