Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Q-DeepSight: Incentivizing Thinking with Images for Image Quality Assessment and Refinement

About

Image Quality Assessment (IQA) models are increasingly deployed as perceptual critics to guide generative models and image restoration. This role demands not only accurate scores but also actionable, localized feedback. However, current MLLM-based methods adopt a single-look, language-only paradigm, which departs from human evidence-seeking judgment and yields weakly grounded rationales, limiting their reliability for in-the-loop refinement. We propose Q-DeepSight, a think-with-image framework that emulates this human-like process. It performs interleaved Multimodal Chain-of-Thought (iMCoT) with tool-augmented evidence acquisition (e.g., crop-and-zoom) to explicitly determine where quality degrades and why. To train these long iMCoT trajectories via reinforcement learning, we introduce two techniques: Perceptual Curriculum Reward (PCR) to mitigate reward sparsity and Evidence Gradient Filtering (EGF) to improve credit assignment for visually-grounded reasoning. Q-DeepSight achieves state-of-the-art performance across diverse benchmarks, including natural, restored, and AI-generated content. Furthermore, we demonstrate its practical value with Perceptual-in-Generation (PiG), a training-free framework where Q-DeepSight's diagnoses guide iterative image enhancement, effectively closing the loop between assessment and refinement.

Xudong Li, Jiaxi Tan, Ziyin Zhou, Yan Zhong, Zihao Huang, Jingyuan Zheng, Yan Zhang, Xiawu Zheng, Rongrong Ji• 2026

Related benchmarks

TaskDatasetResultRank
Image Quality AssessmentSPAQ
SRCC0.911
275
Image Quality AssessmentCSIQ
SRC0.847
192
Image Quality AssessmentKADID
SRCC0.772
164
Image Quality AssessmentPIPAL
SRCC0.502
159
Image Quality AssessmentKonIQ
SRCC0.942
148
Image Quality AssessmentAGIQA-3K
SRCC0.768
137
Image Quality AssessmentAGIQA
SRCC0.768
43
No-Reference Image Quality AssessmentLiveW
PLCC91.4
33
Multi-Image Quality ComparisonSRbench
Registration Accuracy88.53
11
Image RestorationDIV4K 50 (test)
NIQE5.26
11
Showing 10 of 12 rows

Other info

Follow for update