| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Audio-Visual Understanding | AVHBench | Overall Score81.7 | 8 | |
| Cross-modal hallucination evaluation | AVHBench | Video-Driven Audio Hallucination Acc79.7 | 8 | |
| Audiovisual Understanding & Reasoning | AVHBench AVC | Score22.6 | 4 | |
| Audiovisual Understanding & Reasoning | AVHBench AVM | Score61.6 | 4 |