Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

General Multi-modal Evaluation Suite

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-modal UnderstandingGeneral Multi-modal Evaluation Suite (VQAv2, GQA, VisWiz, ScienceQA-IMG, TextVQA, POPE, MMBench, MM-Vet) standard (test val)
VQAv2 Accuracy77.7
9
Showing 1 of 1 rows