Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Object-HalBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Hallucination EvaluationObject HalBench
CHAIR Score (s)54.7
78
Generative HallucinationObject HalBench
CHAIR_S Score61
43
Hallucination assessmentObject-HalBench
Mention Hallucination Rate2.6
39
Object Hallucination EvaluationObject HalBench (test)
CHAIRS Score43.7
24
Hallucination EvaluationObject HalBench full benchmark
Ha (Living Room)25.2
5
Showing 5 of 5 rows