| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Hallucination Evaluation | CHAIR | CHAIR_s72.8 | 166 | |
| Fine-Grained Sketch-Based Image Retrieval (FG-SBIR) | Chair V2 (test) | Top-1 Accuracy89.69 | 72 | |
| Object Hallucination in Open-ended Captioning | CHAIR (test) | CHAIR_S62.3 | 50 | |
| Object Hallucination Evaluation | CHAIR | CS Score57 | 49 | |
| Hallucination Evaluation | CHAIR MSCOCO 2014 (val) | CHAIRi26.2 | 39 | |
| Hallucination Mitigation | CHAIR | CHAIR_S75 | 24 | |
| Object Hallucination Evaluation | CHAIR MS COCO based (test) | CHAIRs56.2 | 18 | |
| Image Captioning | CHAIR | CHAIR_S31.3 | 16 | |
| Language Quality Evaluation | CHAIR benchmark (test) | BLEU-119.2 | 16 | |
| Object Hallucination Evaluation | CHAIR (val) | CHAIRs Score58.8 | 15 | |
| Object-level Composed Retrieval | Chair V2 | Acc.@573.5 | 10 | |
| Sketch-to-Photo Generation | Chair V2 | FID90.21 | 8 | |
| Object Velocity Tracking | Chair | MAE vx (m/s)0.0409 | 7 | |
| Hallucination Evaluation | CHAIR v1.0 (test) | CS52.93 | 6 | |
| 3D Object Completion | Chair (test) | Chamfer Distance (CD)20.3 | 6 | |
| Novel View Synthesis | Chair NeRF Blender (test) | PSNR34.92 | 6 | |
| Efficiency Comparison | CHAIR benchmark | Avg Time (s/caption)3.36 | 5 | |
| Text-to-3D Generation | Chair | Metric- | 0 |