Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MSCOCO

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-to-Image RetrievalMSCOCO 5K (test)
R@160.9
312
Image-to-text retrievalMSCOCO
R@181.9
152
Text RetrievalMSCOCO
Recall@1100
142
Text-to-image retrievalMSCOCO
R@164.3
142
Text-to-Image RetrievalMSCOCO 1K (test)
R@16,390
118
Visual Hallucination EvaluationMSCOCO
CHAIR_i18.2
104
Object Hallucination EvaluationMSCOCO 2014 (val)
CHAIRs56.8
81
Object Hallucination EvaluationMSCOCO POPE
Random Accuracy91.63
71
Image-to-text retrievalMSCOCO 5K (test)
R@184.8
68
Sentence RetrievalMSCOCO 5k (test)
R@180.9
67
Object DetectionMSCOCO
ASR94.5
54
Text-to-Image GenerationMSCOCO 30K
FID6.61
54
Text-to-Image RetrievalMSCOCO (val)
R@138.97
51
Image-to-Text RetrievalMSCOCO (val)
R@158.14
51
Text-to-image RetrievalMSCOCO (5K)
R@153.98
51
Object HallucinationMSCOCO 500 images 2014 (val)
Consistency Score (CS)60.6
50
Text-to-Image RetrievalMSCOCO
mAP@5094
47
Object Hallucination DetectionMSCOCO
AUROC89.62
46
Text-to-Image GenerationMSCOCO 2014
FID (30k)9.29
44
Object Hallucination EvaluationMSCOCO
Accuracy93.87
43
Object DetectionMSCOCO (val)
AP61.3
43
Image-to-text RetrievalMSCOCO (5K)
R@177.8
42
Pointing gameMSCOCO 2014 (val)
Mean Accuracy (All)69.9
42
Image RetrievalMSCOCO @5000 (test)
mAP87.27
39
Object Hallucination AssessmentMSCOCO
CHAIR Instance Score30.2
38
Showing 25 of 220 rows
...