Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Q-Hawkeye: Reliable Visual Policy Optimization for Image Quality Assessment

About

Image Quality Assessment (IQA) predicts perceptual quality scores consistent with human judgments. Recent RL-based IQA methods built on MLLMs focus on generating visual quality descriptions and scores, ignoring two key reliability limitations: (i) although the model's prediction stability varies significantly across training samples, existing GRPO-based methods apply uniform advantage weighting, thereby amplifying noisy signals from unstable samples in gradient updates; (ii) most works emphasize text-grounded reasoning over images while overlooking the model's visual perception ability of image content. In this paper, we propose Q-Hawkeye, an RL-based reliable visual policy optimization framework that redesigns the learning signal through unified Uncertainty-Aware Dynamic Optimization and Perception-Aware Optimization. Q-Hawkeye estimates predictive uncertainty using the variance of predicted scores across multiple rollouts and leverages this uncertainty to reweight each sample's update strength, stabilizing policy optimization. To strengthen perceptual reliability, we construct paired inputs of degraded images and their original images and introduce an Implicit Perception Loss that constrains the model to ground its quality judgments in genuine visual evidence. Extensive experiments demonstrate that Q-Hawkeye outperforms state-of-the-art methods and generalizes better across multiple datasets. Our dataset and code are available at https://github.com/AMAP-ML/Q-Hawkeye.

Wulin Xie, Rui Dai, Ruidong Ding, Kaikui Liu, Xiangxiang Chu, Xinwen Hou, Jie Wen• 2026

Related benchmarks

TaskDatasetResultRank
Image Quality AssessmentSPAQ
SRCC0.903
191
Image Quality AssessmentCSIQ
SRC0.806
138
Image Quality AssessmentAGIQA-3K
SRCC0.752
112
Image Quality AssessmentKADID
SRCC77.5
95
Image Quality AssessmentPIPAL
SRCC55
95
Blind Image Quality AssessmentFLIVE
SRCC0.513
86
Image Quality AssessmentKonIQ
SRCC0.951
82
Image Quality AssessmentLIVE-Wild
PLCC0.909
35
Showing 8 of 8 rows

Other info

Follow for update