Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

WorldSense

Benchmarks

Task NameDataset NameSOTA ResultTrend
Audio-visual understandingWorldSense
Accuracy66.4
32
Multimodal Fact-Level AttributionWorldSense 1.0 (sampled examples)
Accuracy71.4
24
Long Audio-Video Question AnsweringWorldSense
Average Accuracy61.2
18
Audio-visual Question AnsweringWorldSense
Accuracy50
18
Audio-Visual PerceptionWorldSense
Score47.4
8
Video UnderstandingWorldSense
Score52.01
8
Common Sense ReasoningWorldSense
Accuracy0.637
6
Visual Question AnsweringWorldSense sampled examples 1.0
Accuracy60
4
Showing 8 of 8 rows