Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TextCaps

Benchmarks

Task NameDataset NameSOTA ResultTrend
Image CaptioningTextCaps
CIDEr164.3
96
Image CaptioningTextCaps (val)
CIDEr163.7
51
Image CaptioningTextCaps (test)
CIDEr164.3
50
Text-oriented Visual Question AnsweringTextCaps
CIDEr144.9
7
Image ReconstructionTextCaps (test)
FID15.51
6
Visually Grounded Language GenerationTextCaps (test)
Score152
4
Showing 6 of 6 rows