Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Hallucination Evaluation on POPE v1.0 (test)
Loading...
87.15
F1 Score
ShareGPT4V 7B + VIG training
85.6316
86.0258
86.42
86.8142
Feb 19, 2026
F1 Score
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
F1 Score
Accuracy
ShareGPT4V 7B + VIG training
VIG training=true
2026.02
87.15
87.24
LLaVA-1.5 13B + VIG training
VIG training=true
2026.02
86.95
87.53
LLaVA-1.5 7B + VIG training
VIG training=true
2026.02
85.93
87.47
LLaVA-1.5 7B
VIG training=false
2026.02
85.9
87.08
LLaVA-1.5 13B
VIG training=false
2026.02
85.72
87.05
ShareGPT4V 7B
VIG training=false
2026.02
85.69
86.98
Feedback
Search any
task
Search any
task