| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Hallucination Evaluation | HallBench | Accuracy73.6 | 31 | |
| Vision-Language Hallucination Evaluation | HallBench | Accuracy64.2 | 15 | |
| Multimodal Hallucination Evaluation | HallBench | Score65.2 | 9 | |
| Hallucination Evaluation | HallBench avg | Hallucination Score58.1 | 7 |