Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Unimodal-prior-induced hallucination evaluation on CMM
Loading...
82.3
Visual Dom. Accuracy
VideoLLaMA2-AV + MAD
61.708
67.054
72.4
77.746
Jan 29, 2026
Visual Dom. Accuracy
Audio Dom. Accuracy
Language Dom. Accuracy
Overall Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Visual Dom. Accuracy
Audio Dom. Accuracy
Language Dom. Accuracy
Overall Accuracy
VideoLLaMA2-AV + MAD
Base Model=VideoLLaMA2...
2026.01
82.3
84.3
77.5
81.3
Qwen2.5-Omni-7B + MAD
Base Model=Qwen2.5-Omn...
2026.01
76.8
84.3
83.3
81.4
VideoLLaMA2-AV
Base Model=VideoLLaMA2...
2026.01
71.8
80
68.8
73.5
VideoLLaMA2-AV + AVCD
Base Model=VideoLLaMA2...
2026.01
71.8
84
71.5
75.8
VideoLLaMA2-AV + VCDExtended
Base Model=VideoLLaMA2...
2026.01
71.3
83.3
74.8
76.4
Qwen2.5-Omni-7B + AVCD
Base Model=Qwen2.5-Omn...
2026.01
66.3
72.8
81
73.3
Qwen2.5-Omni-7B
Base Model=Qwen2.5-Omn...
2026.01
64.5
72.3
81.3
72.7
Qwen2.5-Omni-7B + VCDExtended
Base Model=Qwen2.5-Omn...
2026.01
62.5
71.3
84.5
72.8
Feedback
Search any
task
Search any
task