| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Hallucination Evaluation | CHAIR | CHAIR_s72.8 | 393 | |
| Object Hallucination Evaluation | CHAIR | CHAIRi Score57 | 154 | |
| Fine-Grained Sketch-Based Image Retrieval (FG-SBIR) | Chair V2 (test) | Top-1 Accuracy89.69 | 72 | |
| Hallucination Evaluation | CHAIR MSCOCO | CHAIR_S59.4 | 64 | |
| Object Hallucination Evaluation | CHAIR MSCOCO v1.0 (val) | CHAIRs54.6 | 51 | |
| Object Hallucination in Open-ended Captioning | CHAIR (test) | CHAIR_S62.3 | 50 | |
| Hallucination Evaluation | CHAIR MSCOCO 2014 (val) | CHAIRi26.2 | 45 | |
| Caption Hallucination Evaluation | CHAIR | CS Score53 | 44 | |
| Object Hallucination Evaluation | CHAIR MSCOCO | CS Score62 | 42 | |
| Long-form generation hallucination evaluation | CHAIR | CS Score58.8 | 36 | |
| Image Captioning | CHAIR | CHAIR_S59 | 32 | |
| Hallucination Evaluation | CHAIR MSCOCO 2014 | CHAIRs Score51.3 | 28 | |
| Hallucination Mitigation | CHAIR | CHAIR_S75 | 24 | |
| Object Hallucination Mitigation | CHAIR | CHAIRs Score69.8 | 22 | |
| Image Captioning | CHAIR (test) | Cs Score52.6 | 22 | |
| Visual Hallucination | CHAIR | CHAIR Score15.3 | 21 | |
| Hallucination Evaluation | CHAIR (test) | CS Score50.9 | 20 | |
| Object Hallucination Evaluation | CHAIR MS COCO based (test) | CHAIRs56.2 | 18 | |
| Hallucination Evaluation | CHAIR (val) | CHAIRs59.5 | 16 | |
| Language Quality Evaluation | CHAIR benchmark (test) | BLEU-119.2 | 16 | |
| Object Hallucination Evaluation | CHAIR (val) | CHAIRs Score58.8 | 15 | |
| Caption Hallucination Assessment | CHAIR Zoom Blur (val) | CHAIRS Score59.8 | 14 | |
| Hallucination Assessment | CHAIR | CS52.8 | 12 | |
| Object-level Composed Retrieval | Chair V2 | Acc.@573.5 | 10 | |
| Object Hallucination Assessment | CHAIR (test) | CS (%)58.2 | 9 |