| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| element-level text-to-image alignment evaluation | MHaluBench | SRCC72.7 | 17 | |
| Hallucination Detection | MHaluBench Image-to-Text Segment-level | Hallucinatory Precision90.44 | 7 | |
| Hallucination Detection | MHaluBench Image-to-Text (Claim-level) | Hallucinatory Precision86.54 | 7 | |
| Visual Reasoning | MHaluBench | SRCC70.6 | 5 |