Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ViTT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Segment-level Video CaptioningViTT Cooking (test)
BLEU-141.61
9
Segment-level Video CaptioningViTT-All (test)
BLEU-143.34
9
Event localizationViTT (test)
Recall45.89
4
Event CaptioningViTT (test)
CIDEr51.29
3
Dense Video CaptioningViTT (test)
SODA_c25
2
Video CaptioningViTT
BLEU-137.89
2
Segment-level Video CaptioningViTT Cooking 1.0 (test)
BLEU-1-
0
Segment-level Video CaptioningViTT-All 1.0 (test)
BLEU-1-
0
Showing 8 of 8 rows