Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MMMU, SEED, OCRBench, VizWiz, ScienceQA, and TextVQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal UnderstandingMMMU, SEED, OCRBench, VizWiz, ScienceQA, and TextVQA Average (test)
Average Accuracy78.1
84
Showing 1 of 1 rows