Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Vectra: A New Metric, Dataset, and Model for Visual Quality Assessment in E-Commerce In-Image Machine Translation

About

In-Image Machine Translation (IIMT) powers cross-border e-commerce product listings; existing research focuses on machine translation evaluation, while visual rendering quality is critical for user engagement. When facing context-dense product imagery and multimodal defects, current reference-based methods (e.g., SSIM, FID) lack explainability, while model-as-judge approaches lack domain-grounded, fine-grained reward signals. To bridge this gap, we introduce Vectra, to the best of our knowledge, the first reference-free, MLLM-driven visual quality assessment framework for e-commerce IIMT. Vectra comprises three components: (1) Vectra Score, a multidimensional quality metric system that decomposes visual quality into 14 interpretable dimensions, with spatially-aware Defect Area Ratio (DAR) quantification to reduce annotation ambiguity; (2) Vectra Dataset, constructed from 1.1M real-world product images via diversity-aware sampling, comprising a 2K benchmark for system evaluation, 30K reasoning-based annotations for instruction tuning, and 3.5K expert-labeled preferences for alignment and evaluation; and (3) Vectra Model, a 4B-parameter MLLM that generates both quantitative scores and diagnostic reasoning. Experiments demonstrate that Vectra achieves state-of-the-art correlation with human rankings, and our model outperforms leading MLLMs, including GPT-5 and Gemini-3, in scoring performance. The dataset and model will be released upon acceptance.

Qingyu Wu, Yuxuan Han, Haijun Li, Zhao Xu, Jianshan Zhao, Xu Jin, Longyue Wang, Weihua Luo• 2026

Related benchmarks

TaskDatasetResultRank
Visual Quality AssessmentVectra 1.0 (test)
Pearson R (Text Size)0.4948
14
In-Image Machine Translation EvaluationVectra-Bench zh→en 1.0
Pearson Correlation (r)0.8
8
In-Image Machine Translation EvaluationVectra-Bench zh→es 1.0
Pearson R0.732
8
In-Image Machine Translation EvaluationVectra-Bench zh→pt 1.0
Pearson Correlation (r)0.773
8
In-Image Machine Translation EvaluationVectra-Bench zh→ja 1.0
Pearson Correlation (r)0.677
8
In-Image Machine Translation EvaluationVectra-Bench zh→fr 1.0
Pearson Correlation0.708
8
In-Image Machine Translation EvaluationVectra-Bench Overall 1.0 (in-domain)
Pearson Correlation (r)0.738
8
In-Image Machine Translation EvaluationMCiT Document 1.0
Pearson Correlation (r)0.599
8
In-Image Machine Translation EvaluationMCiT 1.0 (Poster)
Pearson Correlation (r)0.631
8
In-Image Machine Translation EvaluationMCiT Overall OOD 1.0
Pearson Correlation (r)0.558
8
Showing 10 of 11 rows

Other info

Follow for update