Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Cross-LVLM

Benchmarks

Task NameDataset NameSOTA ResultTrend
Truthfulness and Calibration EvaluationCross-LVLM Pooled Average (GQA, POPE, etc.)
ECE7.1
8
Multimodal UnderstandingCross-LVLM (Aggregate of GQA, GMAI-MMBench, POPE, MME-Finance, MMMU_Pro, LLaVA-Wild) (test)
ECE13.8
8
Showing 2 of 2 rows