Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SugarCrepe

Benchmarks

Task NameDataset NameSOTA ResultTrend
Compositional ReasoningSugarCrepe
Overall Accuracy87.5
50
Image-to-text retrievalSugarCrepe
R@1 (Add)98.44
30
Compositional EvaluationSugarCrepe swap att (test)
Accuracy82.1
27
Compositional ReasoningSugarCrepe++
Average Performance75.2
25
Compositional EvaluationSugarCrepe
Add Score94.2
21
Image-Text Compositionality EvaluationSugarCrepe ++ (test)
Replace ITT79.7
21
Language CompositionalitySugarCrepe (test)
Replace: Object (R@1)100
21
Vision-Language CompositionalitySugarCrepe
Accuracy88.06
20
Vision-Language Compositional ReasoningSugarCrepe++
Accuracy66.24
20
Compositional EvaluationSugarCrepe (test)
Replace (Object)95.52
20
Hallucination DetectionSugarCrepe 1.0 (test)
Avg-M Score98.86
18
Image-Text MatchingSugarCrepe
AURC16.7
17
Text-to-Image Compositional UnderstandingSugarCrepe++ T2I
Accuracy61.05
15
Compositional UnderstandingSugarCrepe
Accuracy89.23
15
Attribute-bindingSugarCrepe++
Replace-I2T79.8
11
Attribute-bindingSugarCrepe
Replace Accuracy89.5
11
Image-Text RetrievalSugarCrepe clean
Transfer R@136.84
9
Hard-negative SelectivitySugarCrepe clean
Attr. Neg. Selectivity88.41
9
Visual Question AnsweringSugarCrepe
Simple Accuracy82.14
9
Compositional Image-Text MatchingSugarCrepe
Replacement Score88.7
9
Compositional ReasoningSUGARCREPE (test)
Accuracy86.3
8
Compositional ReasoningSugarCrepe 1.0 (test)
Replace Acc (Object)100
8
Language CompositionalitySugarCrepe 1.0 (test)
Recall@1 (Replace, Object)88.1
8
Vision-Language ReasoningSugarCrepe (test)
Simple Accuracy62.75
7
Image-Caption AlignmentSugarCrepe (test)
Replace Object96.9
7
Showing 25 of 37 rows