Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CMM

Benchmarks

Task NameDataset NameSOTA ResultTrend
Audio Hallucination DetectionCMM
Audio-Language PA95
13
Unimodal-prior-induced hallucination evaluationCMM
Visual Dom. Accuracy82.3
8
Vision-Audio-Language (VAL)CMM
PA Accuracy (Yes Instances)84.5
5
Multimodal Hallucination DetectionCMM Hallucination
Accuracy87
2
Showing 4 of 4 rows