Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models

About

We introduce a Depicted image Quality Assessment method (DepictQA), overcoming the constraints of traditional score-based methods. DepictQA allows for detailed, language-based, human-like evaluation of image quality by leveraging Multi-modal Large Language Models (MLLMs). Unlike conventional Image Quality Assessment (IQA) methods relying on scores, DepictQA interprets image content and distortions descriptively and comparatively, aligning closely with humans' reasoning process. To build the DepictQA model, we establish a hierarchical task framework, and collect a multi-modal IQA training dataset. To tackle the challenges of limited training data and multi-image processing, we propose to use multi-source training data and specialized image tags. These designs result in a better performance of DepictQA than score-based approaches on multiple benchmarks. Moreover, compared with general MLLMs, DepictQA can generate more accurate reasoning descriptive languages. We also demonstrate that our full-reference dataset can be extended to non-reference applications. These results showcase the research potential of multi-modal IQA methods. Codes and datasets are available in https://depictqa.github.io.

Zhiyuan You, Zheyuan Li, Jinjin Gu, Zhenfei Yin, Tianfan Xue, Chao Dong• 2023

Related benchmarks

Task	Dataset	Result
Distortion Identification	PANDABENCH Easy	Accuracy75	14
Distortion type classification	PANDABENCH (Hard set)	Accuracy22	14
Distortion Severity Prediction	PANDABENCH Easy	Accuracy55	13
Severity level classification	PANDABENCH (Hard set)	Accuracy30	13
Multi-Image Quality Comparison	SRbench	Registration Accuracy73	11
Comparative Relationship Prediction	PANDABENCH Easy	Accuracy49	9
Quality Score Assessment	PANDABENCH Easy	SRCC78	9
Quality Scoring	PANDABENCH (Hard set)	SRCC0.18	9
Region-wise comparison assessment	PANDABENCH (Hard set)	Accuracy33	9
Image Quality Description	KonIQ	Accuracy5.4	8

Showing 10 of 12 rows

Other info

Follow for update

@wizwand_team Discord