Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

NegBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Vision-Language UnderstandingNegBench VOC2007
Accuracy95.37
11
Vision-Language UnderstandingNegBench COCO
Accuracy93.19
11
Multiple Choice Visual Question AnsweringNegBench VOC 2007 (test)
Accuracy95.37
11
Multiple Choice Visual Question AnsweringNegBench COCO (test)
Accuracy93.19
11
Multiple Choice QuestionNegBench COCO subset
Overall Accuracy32.55
4
Showing 5 of 5 rows