Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ME-IQA: Memory-Enhanced Image Quality Assessment via Re-Ranking

About

Reasoning-induced vision-language models (VLMs) advance image quality assessment (IQA) with textual reasoning, yet their scalar scores often lack sensitivity and collapse to a few values, so-called discrete collapse. We introduce ME-IQA, a plug-and-play, test-time memory-enhanced re-ranking framework. It (i) builds a memory bank and retrieves semantically and perceptually aligned neighbors using reasoning summaries, (ii) reframes the VLM as a probabilistic comparator to obtain pairwise preference probabilities and fuse this ordinal evidence with the initial score under Thurstone's Case V model, and (iii) performs gated reflection and consolidates memory to improve future decisions. This yields denser, distortion-sensitive predictions and mitigates discrete collapse. Experiments across multiple IQA benchmarks show consistent gains over strong reasoning-induced VLM baselines, existing non-reasoning IQA methods, and test-time scaling alternatives.

Kanglong Fan, Tianhe Wu, Wen Wen, Jianzhao Liu, Le Yang, Yabin Zhang, Yiting Liao, Junlin Li, Li Zhang• 2026

Related benchmarks

TaskDatasetResultRank
No-Reference Image Quality AssessmentCSIQ
SROCC0.815
121
No-Reference Image Quality AssessmentTID 2013
SRCC0.619
105
No-Reference Image Quality AssessmentSPAQ
SROCC0.924
92
No-Reference Image Quality AssessmentKADID
SRCC0.785
43
No-Reference Image Quality AssessmentAGIQA
PLCC0.851
18
No-Reference Image Quality AssessmentPIPAL
PLCC0.642
18
No-Reference Image Quality AssessmentLiveW
PLCC88.7
18
Showing 7 of 7 rows

Other info

Follow for update