Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MMVet

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-modal Vision-Language UnderstandingMMVet
Score81.3
38
Self-evaluationMMVet
AUROC0.886
36
Multi-modal UnderstandingMMVet
Accuracy76.2
35
Multi-modal ReasoningMMVet (test)
Accuracy80.8
30
Multimodal UnderstandingMMVet turbo
Accuracy74
28
Multimodal UnderstandingMMVet v2 (0613)
Accuracy71.8
21
Multi-modal Vision-Language EvaluationMMVet
Accuracy46.8
19
General Visual Question AnsweringMMVet 2024b
Score66.8
13
Multimodal UnderstandingMMVet
Pass@174.94
9
Pointwise ScoringMMVet pointwise
Kendall's Tau0.974
9
Multimodal ComprehensionMMVet
Score58
8
General VQAMMVet
Score66.8
8
General Visual Question AnsweringMMVet turbo
Score76.2
7
Vision UnderstandingMMVet v1.0 (test)
Score36.87
6
Universal multi-modal reasoningMMVet
Pass@163.31
2
Showing 15 of 15 rows