Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Generalist VLM Benchmarks

Benchmarks

Task NameDataset NameSOTA ResultTrend
Instruction Following and Multimodal ReasoningGeneralist VLM Benchmarks IFEval, MMBench
IFEval Score61.74
10
Showing 1 of 1 rows