Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MSRVTT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Video Question AnsweringMSRVTT-QA
Accuracy72.4
505
Video Question AnsweringMSRVTT-QA (test)
Accuracy88.2
376
Text-To-Video retrievalMSRVTT (test)
Recall@546
178
Text-to-Video RetrievalMSRVTT
R@163.9
144
Video CaptioningMSRVTT
CIDEr80.3
107
Video Question AnsweringMSRVTT
Accuracy66.7
100
Text-to-video retrievalMSRVTT
R@161
75
Video CaptioningMSRVTT
CIDEr80.3
68
Text-to-Video RetrievalMSRVTT 1k (test)
Recall@1087.4
63
Video CaptioningMSRVTT (test)
CIDEr80.5
61
Video Question AnsweringMSRVTT-MC
Accuracy97.7
61
Text-to-Video RetrievalMSRVTT
Recall@149.9
59
Video UnderstandingMSRVTT
Acc57.7
43
Video-to-Text RetrievalMSRVTT 1kA severity degree 2
Performance (Gaussian Noise)44.2
42
Text-to-Video RetrievalMSRVTT (1K-A)
R@149.3
42
Video GenerationMSRVTT (val)
FVD414
40
Text-to-Video RetrievalMSRVTT
Recall@151
38
Video-to-Text RetrievalMSRVTT
R@149.2
35
Text-to-Video RetrievalMSRVTT (UTD)
Recall@131.1
34
Text-to-Video RetrievalMSRVTT full (test val)
Recall@143.6
34
Video Question AnsweringMSRVTT-MC (test)
Accuracy97.8
31
Text-to-Video RetrievalMSRVTT (MSR) zero-shot
R@143.3
30
Video-to-Text RetrievalMSRVTT
Recall@140.4
28
Video-to-Text RetrievalMSRVTT v2t 1.0
Performance (Gaussian Noise)24.7
28
Video Question AnsweringMSRVTT (test)
Accuracy92.7
26
Showing 25 of 76 rows