Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CC3M

Benchmarks

Task NameDataset NameSOTA ResultTrend
Object RecognitionCC3M (test)
Recall0.738
21
Multimodal UnderstandingCC3M IOD
Accuracy100
14
Malicious Prompt DetectionCC3M (IOD)
FPR0
14
Text-to-Image RetrievalCC3M
Recall45.7
9
Image-to-Text RetrievalCC3M
Recall47.2
9
Image ClassificationCC3M
Accuracy46.7
9
Multi-Tag SelectionCC3M (test)
Precision92.5
9
Text-to-image generationCC3M
FID6.06
7
Multi-Tag SelectionCC3M
Precision0.883
6
Vision-Language Compositional EvaluationCC3M 50,000 random subset TripletData
Text Score92.25
4
Text-level Semantic SegmentationCC3M (subset)
Caption IoU65.5
4
Object RecognitionCC3M
Recall86.8
3
Showing 12 of 12 rows