Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Object Hallucination Detection on POPE (Popular Average across MS-COCO, A-OKVQA, GQA)
Loading...
85.2
Accuracy
VAF
81.976
82.813
83.65
84.487
Mar 17, 2025
Accuracy
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
F1 Score
VAF
Backbone=LLaVA-v1.5-13B
2025.03
85.2
86.4
VAF
Backbone=Qwen-VL-Chat-7B
2025.03
84.9
85.1
VAF
Backbone=LLaVA-v1.5-7B
2025.03
84.5
84.9
VCD
Backbone=LLaVA-v1.5-13B
2025.03
83.7
85.1
ICD
Backbone=Qwen-VL-Chat-7B
2025.03
83.2
84.5
VCD
Backbone=LLaVA-v1.5-7B
2025.03
83.1
84.1
VCD
Backbone=Qwen-VL-Chat-7B
2025.03
83
84.1
ICD
Backbone=LLaVA-v1.5-13B
2025.03
82.9
84.3
Regular
Backbone=LLaVA-v1.5-13B
2025.03
82.7
84.1
Regular
Backbone=LLaVA-v1.5-7B
2025.03
82.5
83.2
Regular
Backbone=Qwen-VL-Chat-7B
2025.03
82.4
83.1
ICD
Backbone=LLaVA-v1.5-7B
2025.03
82.1
82.9
Feedback
Search any
task
Search any
task