Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to Rank

About

DeepSeek-R1 has demonstrated remarkable effectiveness in incentivizing reasoning and generalization capabilities of large language models (LLMs) through reinforcement learning. Nevertheless, the potential of reasoning-induced computation has not been thoroughly explored in the context of image quality assessment (IQA), a task depending critically on visual reasoning. In this paper, we introduce VisualQuality-R1, a reasoning-induced no-reference IQA (NR-IQA) model, and we train it with reinforcement learning to rank, a learning algorithm tailored to the intrinsically relative nature of visual quality. Specifically, for a pair of images, we employ group relative policy optimization to generate multiple quality scores for each image. These estimates are used to compute comparative probabilities of one image having higher quality than the other under the Thurstone model. Rewards for each quality estimate are defined using continuous fidelity measures rather than discretized binary labels. Extensive experiments show that the proposed VisualQuality-R1 consistently outperforms discriminative deep learning-based NR-IQA models as well as a recent reasoning-induced quality regression method. Moreover, VisualQuality-R1 is capable of generating contextually rich, human-aligned quality descriptions, and supports multi-dataset training without requiring perceptual scale realignment. These features make VisualQuality-R1 especially well-suited for reliably measuring progress in a wide range of image processing tasks like super-resolution and image generation.

Tianhe Wu, Jian Zou, Jie Liang, Lei Zhang, Kede Ma• 2025

Related benchmarks

TaskDatasetResultRank
Image Quality AssessmentSPAQ
SRCC0.913
250
Image Quality AssessmentCSIQ
SRC0.797
150
Image Quality AssessmentAGIQA-3K
SRCC0.797
131
Image Quality AssessmentKADID
SRCC71.9
128
Image Quality AssessmentKonIQ-10k
SRCC0.812
126
Image Quality AssessmentPIPAL
SRCC48.6
123
Image Quality AssessmentKonIQ
SRCC0.908
119
Blind Image Quality AssessmentFLIVE
SRCC0.471
115
Blind Image Quality AssessmentBID
SRCC0.774
63
Image Quality AssessmentCLIVE
SRCC0.826
54
Showing 10 of 44 rows

Other info

Follow for update