Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MSVD

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-to-Video RetrievalMSVD
R@171.9
264
Text-to-Video RetrievalMSVD (test)
R@12,030
204
Video CaptioningMSVD
CIDEr195.6
157
Video Question AnsweringMSVD
Accuracy79.5
152
Video CaptioningMSVD (test)
CIDEr189.4
111
Video-to-Text RetrievalMSVD
R@188.4
93
Video-to-Text RetrievalMSVD (test)
R@183.1
61
Open-ended Video Question AnsweringMSVD-QA
Accuracy79.9
59
Video Question AnsweringMSVD (test)
Accuracy76.4
30
Video-Text RetrievalMSVD
R@162.7
29
Open Ended Question AnsweringMSVD
Accuracy73.92
22
Video UnderstandingMSVD
Accuracy71.6
21
Text-to-Video RetrievalMSVD (val)
Recall@151.8
15
Video CaptioningMSVD-CTN (test)
ROUGE-L31.46
10
Emotional video captioningEVC-MSVD
Accuracy (SW)91.3
9
Text-to-Video RetrievalMSVD zero-shot
Recall@149.9
8
Text RetrievalMSVD
R@161.5
8
Text-to-Video RetrievalMSVD 43 (val)
Recall@150
7
Vehicle Action UnderstandingMSVD
BLEU-10.88
6
Video CaptioningMSVD
METEOR51.2
6
Text-to-video retrievalMSVD 10s (test)
R@139.3
6
Video CaptioningMSVD Cap
CIDEr118.2
4
Video Question AnsweringMSVD Open Ended (OE)
Accuracy48.9
4
Video CaptioningMSVD
SB84.32
4
Video-to-Text RetrievalMSVD 43 (val)
R@168.7
4
Showing 25 of 27 rows