Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ConBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal Consistency EvaluationConBench
ECE8.41
14
Vision-Language Reasoning and KnowledgeConBench
Accuracy76.51
11
Showing 2 of 2 rows