Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SugarCrepe

Benchmarks

Task NameDataset NameSOTA ResultTrend
Compositional ReasoningSugarCrepe
Overall Accuracy87.5
50
Compositional EvaluationSugarCrepe swap att (test)
Accuracy82.1
27
Compositional EvaluationSugarCrepe
Add Score94.2
21
Image-Text Compositionality EvaluationSugarCrepe ++ (test)
Replace ITT79.7
21
Language CompositionalitySugarCrepe (test)
Replace: Object (R@1)100
21
Vision-Language CompositionalitySugarCrepe
Accuracy88.06
20
Vision-Language Compositional ReasoningSugarCrepe++
Accuracy66.24
20
Compositional EvaluationSugarCrepe (test)
Replace (Object)95.52
20
Image-Text MatchingSugarCrepe
AURC16.7
17
Text-to-Image Compositional UnderstandingSugarCrepe++ T2I
Accuracy61.05
15
Compositional UnderstandingSugarCrepe
Accuracy89.23
15
Attribute-bindingSugarCrepe++
Replace-I2T79.8
11
Attribute-bindingSugarCrepe
Replace Accuracy89.5
11
Visual Question AnsweringSugarCrepe
Simple Accuracy82.14
9
Compositional Image-Text MatchingSugarCrepe
Replacement Score88.7
9
Compositional ReasoningSUGARCREPE (test)
Accuracy86.3
8
Compositional ReasoningSugarCrepe 1.0 (test)
Replace Acc (Object)100
8
Language CompositionalitySugarCrepe 1.0 (test)
Recall@1 (Replace, Object)88.1
8
Image-to-text retrievalSugarCrepe
R@1 (Add)73.8
8
Compositional ReasoningSugarCrepe++
Replace I2T79.7
7
Vision-Language ReasoningSugarCrepe (test)
Simple Accuracy62.75
7
Image-Caption AlignmentSugarCrepe (test)
Replace Object96.9
7
Hard-negative classificationSugarCrepe
Replace: Object Accuracy91.38
6
Hallucination ReasoningSugarCrepe
Accuracy86.4
5
Vision-Language AlignmentSugarcrepe swap-object
Accuracy63.8
4
Showing 25 of 27 rows