| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Hallucination Evaluation | MMHal-Bench | MMHal Score4.7 | 306 | |
| Multimodal Hallucination Evaluation | MMHal-Bench | Average Score4.84 | 129 | |
| Image+Text-to-Text Hallucination Evaluation | MMHal-Bench | BERT Score79 | 18 | |
| Generative Hallucination Mitigation | MMHal-Bench | Overall Score3.49 | 13 | |
| Multi-modal Hallucination Evaluation | MMHal-Bench v1.0 (test) | Overall Score2.14 | 12 | |
| Hallucination Evaluation | MMHal-Bench-V | Hallucination Score2.57 | 9 |