Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AVUT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Audio-Visual UnderstandingAVUT AV-Human
Accuracy0.7834
12
Audio-Visual UnderstandingAVUT
Score85.6
8
Audio-Visual QAAVUT
Accuracy66.57
6
Omni-modal UnderstandingAVUT-Human
Overall Score78.6
3
Showing 4 of 4 rows