Decoupling Perception and Calibration: Label-Efficient Image Quality Assessment Framework

About

Recent multimodal large language models (MLLMs) have demonstrated strong capabilities in image quality assessment (IQA) tasks. However, adapting such large-scale models is computationally expensive and still relies on substantial Mean Opinion Score (MOS) annotations. We argue that for MLLM-based IQA, the core bottleneck lies not in the quality perception capacity of MLLMs, but in MOS scale calibration. Therefore, we propose LEAF, a Label-Efficient Image Quality Assessment Framework that distills perceptual quality priors from an MLLM teacher into a lightweight student regressor, enabling MOS calibration with minimal human supervision. Specifically, the teacher conducts dense supervision through point-wise judgments and pair-wise preferences, with an estimate of decision reliability. Guided by these signals, the student learns the teacher's quality perception patterns through joint distillation and is calibrated on a small MOS subset to align with human annotations. Experiments on both user-generated and AI-generated IQA benchmarks demonstrate that our method significantly reduces the need for human annotations while maintaining strong MOS-aligned correlations, making lightweight IQA practical under limited annotation budgets.

Xinyue Li, Zhichao Zhang, Zhiming Xu, Shubo Xu, Xiongkuo Min, Yitong Chen, Guangtao Zhai• 2026

Related benchmarks

Task	Dataset	Result
Image Quality Assessment	SPAQ	SRCC0.921	311
Image Quality Assessment	KonIQ-10k	SRCC0.899	126
Image Quality Assessment	AGIQA 3K (test)	SRCC0.868	84
Image Quality Assessment	AIGIQA-20K (test)	SRCC0.86	23

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord