Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LongVALE

Benchmarks

Task NameDataset NameSOTA ResultTrend
Video-to-audio generationLongVale
FD (VGG)3.23
8
Omni-modal segment captioningLongVALE 1.0 (test)
ROUGE-L0.224
8
Omni-modal dense video captioningLongVALE 1.0 (test)
SODA_c2.8
8
Omni-modal temporal video groundingLongVALE 1.0 (test)
R@0.315.7
8
Showing 4 of 4 rows