Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SHAMISA: SHAped Modeling of Implicit Structural Associations for Self-supervised No-Reference Image Quality Assessment

About

No-Reference Image Quality Assessment (NR-IQA) aims to estimate perceptual quality without access to a reference image of pristine quality. Learning an NR-IQA model faces a fundamental bottleneck: its need for a large number of costly human perceptual labels. We propose SHAMISA, a non-contrastive self-supervised framework that learns from unlabeled distorted images by leveraging explicitly structured relational supervision. Unlike prior methods that impose rigid, binary similarity constraints, SHAMISA introduces implicit structural associations, defined as soft, controllable relations that are both distortion-aware and content-sensitive, inferred from synthetic metadata and intrinsic feature structure. A key innovation is our compositional distortion engine, which generates an uncountable family of degradations from continuous parameter spaces, grouped so that only one distortion factor varies at a time. This enables fine-grained control over representational similarity during training: images with shared distortion patterns are pulled together in the embedding space, while severity variations produce structured, predictable shifts. We integrate these insights via dual-source relation graphs that encode both known degradation profiles and emergent structural affinities to guide the learning process throughout training. A convolutional encoder is trained under this supervision and then frozen for inference, with quality prediction performed by a linear regressor on its features. Extensive experiments on synthetic, authentic, and cross-dataset NR-IQA benchmarks demonstrate that SHAMISA achieves strong overall performance with improved cross-dataset generalization and robustness, all without human quality annotations or contrastive losses.

Mahdi Naseri, Zhou Wang• 2026

Related benchmarks

TaskDatasetResultRank
No-Reference Image Quality AssessmentCSIQ
SROCC0.981
121
Blind Image Quality AssessmentFLIVE
SRCC0.61
115
No-Reference Image Quality AssessmentKADID-10K
SROCC0.922
115
No-Reference Image Quality AssessmentTID 2013
SRCC0.904
105
No-Reference Image Quality AssessmentSPAQ
SROCC0.914
92
No-Reference Image Quality AssessmentLIVE
SROCC0.986
83
Full Reference Image Quality AssessmentTID 2013
SRCC0.909
54
Full Reference Image Quality AssessmentCSIQ-IQA
SRCC0.968
52
Full Reference Image Quality AssessmentLIVE
PLCC0.97
45
Full Reference Image Quality AssessmentKADID-10K
SRCC0.899
10
Showing 10 of 10 rows

Other info

GitHub

Follow for update