Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MSRVTT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Video Question AnsweringMSRVTT-QA
Accuracy72.4
491
Video Question AnsweringMSRVTT-QA (test)
Accuracy88.2
376
Text-To-Video retrievalMSRVTT (test)
Recall@118.2
155
Text-to-Video RetrievalMSRVTT
R@163.9
116
Video CaptioningMSRVTT
CIDEr80.3
107
Video Question AnsweringMSRVTT
Accuracy66.7
100
Text-to-video retrievalMSRVTT
R@161
75
Video CaptioningMSRVTT
CIDEr80.3
68
Text-to-Video RetrievalMSRVTT 1k (test)
Recall@1087.4
63
Video CaptioningMSRVTT (test)
CIDEr80.5
61
Video Question AnsweringMSRVTT-MC
Accuracy97.7
61
Text-to-Video RetrievalMSRVTT
Recall@149.9
59
Text-to-Video RetrievalMSRVTT (1K-A)
R@149.3
42
Video GenerationMSRVTT (val)
FVD414
40
Text-to-Video RetrievalMSRVTT
Recall@151
38
Video-to-Text RetrievalMSRVTT
R@149.2
35
Text-to-Video RetrievalMSRVTT (UTD)
Recall@131.1
34
Text-to-Video RetrievalMSRVTT full (test val)
Recall@143.6
34
Video Question AnsweringMSRVTT-MC (test)
Accuracy97.8
31
Text-to-Video RetrievalMSRVTT (MSR) zero-shot
R@143.3
30
Video Question AnsweringMSRVTT (test)
Accuracy92.7
26
Video-to-Text RetrievalMSRVTT
R@150.1
24
Text-to-Video RetrievalMSRVTT 1K-A (test)
R@154.2
23
Text-to-Video RetrievalMSRVTT 1K 1.0 (test)
R@140.9
23
Video UnderstandingMSRVTT
Acc57.7
21
Showing 25 of 61 rows