Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Decoupling Perception and Calibration: Label-Efficient Image Quality Assessment Framework

About

Recent multimodal large language models (MLLMs) have demonstrated strong capabilities in image quality assessment (IQA) tasks. However, adapting such large-scale models is computationally expensive and still relies on substantial Mean Opinion Score (MOS) annotations. We argue that for MLLM-based IQA, the core bottleneck lies not in the quality perception capacity of MLLMs, but in MOS scale calibration. Therefore, we propose LEAF, a Label-Efficient Image Quality Assessment Framework that distills perceptual quality priors from an MLLM teacher into a lightweight student regressor, enabling MOS calibration with minimal human supervision. Specifically, the teacher conducts dense supervision through point-wise judgments and pair-wise preferences, with an estimate of decision reliability. Guided by these signals, the student learns the teacher's quality perception patterns through joint distillation and is calibrated on a small MOS subset to align with human annotations. Experiments on both user-generated and AI-generated IQA benchmarks demonstrate that our method significantly reduces the need for human annotations while maintaining strong MOS-aligned correlations, making lightweight IQA practical under limited annotation budgets.

Xinyue Li, Zhichao Zhang, Zhiming Xu, Shubo Xu, Xiongkuo Min, Yitong Chen, Guangtao Zhai• 2026

Related benchmarks

TaskDatasetResultRank
Image Quality AssessmentSPAQ
SRCC0.921
191
Image Quality AssessmentKonIQ-10k
SRCC0.899
96
Image Quality AssessmentAGIQA 3K (test)
SRCC0.868
84
Image Quality AssessmentAIGIQA-20K (test)
SRCC0.86
23
Showing 4 of 4 rows

Other info

Follow for update