Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

NegBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multiple Choice Visual Question AnsweringNegBench VOC 2007 (test)
Accuracy95.37
11
Multiple Choice Visual Question AnsweringNegBench COCO (test)
Accuracy93.19
11
Multiple Choice QuestionNegBench COCO subset
Overall Accuracy32.55
4
Showing 3 of 3 rows