Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Object Hallucination Detection on POPE MS-COCO Overall
Loading...
86.83
Accuracy
BRACS
83.9076
84.6663
85.425
86.1837
May 28, 2026
Accuracy
F1 Score
Updated 5d ago
Evaluation Results
Method
Method
Links
Accuracy
F1 Score
BRACS
Backbone=Qwen-VL-Chat,...
2026.05
86.83
85.81
VDD-None
Backbone=Qwen-VL-Chat,...
2026.05
86.78
85.78
BRACS
Backbone=LLaVA-1.5-7B,...
2026.05
86.63
86.37
PAI
Backbone=Qwen-VL-Chat,...
2026.05
86.26
85.33
SPIN
Backbone=LLaVA-1.5-7B,...
2026.05
85.5
84.15
VCD
Backbone=Qwen-VL-Chat,...
2026.05
85.4
83.95
SPIN
Backbone=Qwen-VL-Chat,...
2026.05
85.19
83.53
Greedy
Backbone=LLaVA-1.5-7B,...
2026.05
85.13
83.64
Greedy
Backbone=Qwen-VL-Chat,...
2026.05
85.1
83.42
VDD-None
Backbone=LLaVA-1.5-7B,...
2026.05
84.51
85.55
VCD
Backbone=LLaVA-1.5-7B,...
2026.05
84.21
83.19
PAI
Backbone=LLaVA-1.5-7B,...
2026.05
84.02
85.3
Feedback
Search any
task
Search any
task