Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CHAIR

Benchmarks

Task NameDataset NameSOTA ResultTrend
Hallucination EvaluationCHAIR
CHAIR_s72.8
393
Object Hallucination EvaluationCHAIR
CHAIRi Score57
154
Fine-Grained Sketch-Based Image Retrieval (FG-SBIR)Chair V2 (test)
Top-1 Accuracy89.69
72
Hallucination EvaluationCHAIR MSCOCO
CHAIR_S59.4
64
Object Hallucination EvaluationCHAIR MSCOCO v1.0 (val)
CHAIRs54.6
51
Object Hallucination in Open-ended CaptioningCHAIR (test)
CHAIR_S62.3
50
Hallucination EvaluationCHAIR MSCOCO 2014 (val)
CHAIRi26.2
45
Caption Hallucination EvaluationCHAIR
CS Score53
44
Object Hallucination EvaluationCHAIR MSCOCO
CS Score62
42
Long-form generation hallucination evaluationCHAIR
CS Score58.8
36
Image CaptioningCHAIR
CHAIR_S59
32
Hallucination EvaluationCHAIR MSCOCO 2014
CHAIRs Score51.3
28
Hallucination MitigationCHAIR
CHAIR_S75
24
Object Hallucination MitigationCHAIR
CHAIRs Score69.8
22
Image CaptioningCHAIR (test)
Cs Score52.6
22
Visual HallucinationCHAIR
CHAIR Score15.3
21
Hallucination EvaluationCHAIR (test)
CS Score50.9
20
Object Hallucination EvaluationCHAIR MS COCO based (test)
CHAIRs56.2
18
Hallucination EvaluationCHAIR (val)
CHAIRs59.5
16
Language Quality EvaluationCHAIR benchmark (test)
BLEU-119.2
16
Object Hallucination EvaluationCHAIR (val)
CHAIRs Score58.8
15
Caption Hallucination AssessmentCHAIR Zoom Blur (val)
CHAIRS Score59.8
14
Hallucination AssessmentCHAIR
CS52.8
12
Object-level Composed RetrievalChair V2
Acc.@573.5
10
Object Hallucination AssessmentCHAIR (test)
CS (%)58.2
9
Showing 25 of 37 rows