Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MSDS: Deep Structural Similarity with Multiscale Representation

About

Deep-feature-based perceptual similarity models have demonstrated strong alignment with human visual perception in Image Quality Assessment (IQA). However, most existing approaches operate at a single spatial scale, implicitly assuming that structural similarity at a fixed resolution is sufficient. The role of spatial scale in deep-feature similarity modeling thus remains insufficiently understood. In this letter, we isolate spatial scale as an independent factor using a minimal multiscale extension of DeepSSIM, referred to as Deep Structural Similarity with Multiscale Representation (MSDS). The proposed framework decouples deep feature representation from cross-scale integration by computing DeepSSIM independently across pyramid levels and fusing the resulting scores with a lightweight set of learnable global weights. Experiments on multiple benchmark datasets demonstrate consistent and statistically significant improvements over the single-scale baseline, while introducing negligible additional complexity. The results empirically confirm spatial scale as a non-negligible factor in deep perceptual similarity, isolated here via a minimal testbed.

Danling Kang, Xue-Hua Chen, Bin Liu, Keke Zhang, Weiling Chen, Tiesong Zhao• 2026

Related benchmarks

TaskDatasetResultRank
Image Quality AssessmentTID 2013 (test)
Mean SRCC0.864
141
Image Quality AssessmentCSIQ (test)
SRCC0.955
110
Image Quality AssessmentKADID-10k (test)
SRCC0.921
101
Full Reference Image Quality AssessmentCSIQ-IQA (test)
SROCC0.964
39
Full Reference Image Quality AssessmentTID 2013 (test)
PLCC0.911
30
Full Reference Image Quality AssessmentLIVE (test)
PLCC0.966
29
Full Reference Image Quality AssessmentPIPAL (test)--
19
Image Quality AssessmentLIVE (test)
SRCC0.967
7
Image Quality AssessmentPIPAL (test)
SRCC0.701
6
Showing 9 of 9 rows

Other info

Follow for update