Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AVHBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Video-driven Audio HallucinationAVHBench
Accuracy83.4
27
Cross-modal hallucination evaluationAVHBench
Overall Accuracy88.19
22
Audiovisual MatchingAVHBench
Accuracy69.68
14
Audio-Visual UnderstandingAVHBench
Overall Score81.7
8
Audio-Visual QAAVHBench
Accuracy73.78
6
Audio-Visual CaptioningAVHBench
METEOR17.2
5
Audiovisual Understanding & ReasoningAVHBench AVC
Score22.6
4
Audiovisual Understanding & ReasoningAVHBench AVM
Score61.6
4
Showing 8 of 8 rows