Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TextCaps

Benchmarks

Task NameDataset NameSOTA ResultTrend
Image CaptioningTextCaps
CIDEr164.3
112
Image CaptioningTextCaps (val)
CIDEr163.7
51
Image CaptioningTextCaps (test)
CIDEr164.3
50
Text-oriented Visual Question AnsweringTextCaps
CIDEr144.9
7
Image ReconstructionTextCaps (test)
FID15.51
6
Image-to-Text RetrievalTextCaps
Recall@197.4
4
Text-to-Image RetrievalTextCaps
Recall@189.6
4
Visually Grounded Language GenerationTextCaps (test)
Score152
4
Showing 8 of 8 rows