Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MSVD

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-to-Video RetrievalMSVD
R@171.9
218
Text-to-Video RetrievalMSVD (test)
R@12,030
204
Video CaptioningMSVD
CIDEr195.6
128
Video CaptioningMSVD (test)
CIDEr189.4
111
Video Question AnsweringMSVD
Accuracy79.5
100
Video-to-Text RetrievalMSVD
R@188.4
93
Video-to-Text RetrievalMSVD (test)
R@183.1
61
Open-ended Video Question AnsweringMSVD-QA
Accuracy79.9
59
Video Question AnsweringMSVD (test)
Accuracy76.4
30
Open Ended Question AnsweringMSVD
Accuracy73.92
22
Video-Text RetrievalMSVD
GFLOPS267.8
18
Text-to-Video RetrievalMSVD (val)
Recall@151.8
15
Video CaptioningMSVD-CTN (test)
ROUGE-L31.46
10
Text-to-Video RetrievalMSVD zero-shot
Recall@149.9
8
Text RetrievalMSVD
R@161.5
8
Text-to-Video RetrievalMSVD 43 (val)
Recall@150
7
Video UnderstandingMSVD
Accuracy70.4
6
Video CaptioningMSVD
METEOR51.2
6
Text-to-video retrievalMSVD 10s (test)
R@139.3
6
Video CaptioningMSVD Cap
CIDEr118.2
4
Video Question AnsweringMSVD Open Ended (OE)
Accuracy48.9
4
Video CaptioningMSVD
SB84.32
4
Video-to-Text RetrievalMSVD 43 (val)
R@168.7
4
Text-to-Video RetrievalMSVD (standard)
Recall@158.4
3
Showing 24 of 24 rows