Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing

About

Reinforcement Learning (RL) has empowered Multimodal Large Language Models (MLLMs) to achieve superior human preference alignment in Image Quality Assessment (IQA). However, existing RL-based IQA models typically rely on coarse-grained global views, failing to capture subtle local degradations in high-resolution scenarios. While emerging "Thinking with Images" paradigms enable multi-scale visual perception via zoom-in mechanisms, their direct adaptation to IQA induces spurious "cropping-implies-degradation" biases and misinterprets natural depth-of-field as artifacts. To address these challenges, we propose Q-Probe, the first agentic IQA framework designed to scale IQA to high resolution via context-aware probing. First, we construct Vista-Bench, a pioneering benchmark tailored for fine-grained local degradation analysis in high-resolution IQA settings. Furthermore, we propose a three-stage training paradigm that progressively aligns the model with human preferences, while simultaneously eliminating causal bias through a novel context-aware cropping strategy. Extensive experiments demonstrate that Q-Probe achieves state-of-the-art performance in high-resolution settings while maintaining superior efficacy across resolution scales.

Xiang Li, XueHeng Li, Yu Wang, XuanHua He, ZhangChi Hu, WeiWei Yu, ChengJun Xie• 2026

Related benchmarks

TaskDatasetResultRank
Image Quality AssessmentSPAQ
SRCC0.892
191
Image Quality AssessmentKADID
SRCC0.901
95
Image Quality AssessmentPIPAL
SRCC0.474
95
Image Quality AssessmentKonIQ
SRCC0.871
82
Image Quality AssessmentVista
SRCC0.728
13
Image Quality AssessmentAGIQA
SRCC0.837
13
Image Quality AssessmentTID13
SRCC0.829
13
Showing 7 of 7 rows

Other info

Follow for update