Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Object presence hallucination evaluation on POPE GQA 2019 (Adversarial)
Loading...
73.07
Accuracy
AVISC
67.7972
69.1661
70.535
71.9039
May 28, 2024
Accuracy
Precision
Recall
F1 Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
Precision
Recall
F1 Score
AVISC
Model=InstructBLIP
2024.05
73.07
67.8
87.87
76.54
VCD
Model=InstructBLIP
2024.05
70.27
65.43
85.93
74.29
AVISC
Model=LLaVA-1.5
2024.05
69.2
62.61
95.33
75.58
M3ID
Model=InstructBLIP
2024.05
68.9
64.06
86.13
73.47
base
Model=LLaVA-1.5
2024.05
68.73
62.54
93.4
74.92
VCD
Model=LLaVA-1.5
2024.05
68.27
62
94.4
74.84
M3ID
Model=LLaVA-1.5
2024.05
68.13
61.88
94.47
74.78
base
Model=InstructBLIP
2024.05
68
63.49
84.73
72.59
Feedback
Search any
task
Search any
task