Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ArtiMuse: Fine-Grained Image Aesthetics Assessment with Joint Scoring and Expert-Level Understanding

About

The rapid advancement of educational applications, artistic creation, and AI-generated content (AIGC) technologies has substantially increased practical requirements for comprehensive Image Aesthetics Assessment (IAA), particularly demanding methods capable of delivering both quantitative scoring and professional understanding. Multimodal Large Language Model (MLLM)-based IAA methods demonstrate stronger perceptual and generalization capabilities compared to traditional approaches, yet they suffer from modality bias (score-only or text-only) and lack fine-grained attribute decomposition, thereby failing to support further aesthetic assessment. In this paper, we present:(1) ArtiMuse, an innovative MLLM-based IAA model with Joint Scoring and Expert-Level Understanding capabilities; (2) ArtiMuse-10K, the first expert-curated image aesthetic dataset comprising 10,000 images spanning 5 main categories and 15 subcategories, each annotated by professional experts with 8-dimensional attributes analysis and a holistic score. Both the model and dataset will be made public to advance the field.

Shuo Cao, Nan Ma, Jiayang Li, Xiaohui Li, Lihao Shao, Kaiwen Zhu, Yu Zhou, Yuandong Pu, Jiarui Wu, Jiaquan Wang, Bo Qu, Wenhai Wang, Yu Qiao, Dajuin Yao, Yihao Liu• 2025

Related benchmarks

TaskDatasetResultRank
Image Aesthetic AssessmentAVA
SRCC0.397
68
Visual Rating (Image Aesthetic Assessment)TAD66K
SRCC0.23
40
Visual Rating (Image Aesthetic Assessment)ArtiMuse-10K
SRCC0.614
34
Visual Rating (Image Aesthetic Assessment)FLICKR-AES
SRCC34.9
33
Visual Question AnsweringUniPercept-Bench-VQA IAA 1.0 (test)
Comprehension Score67.69
18
Semiosis Quality EvaluationHGI SemiosisArt
KRCC0.075
18
Visual Question AnsweringHGI SemiosisArt
Accuracy67.7
10
Showing 7 of 7 rows

Other info

Follow for update