Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Object Hallucination Probing on OKVQA POPE Adversarial
Loading...
79.97
POPE Score (Zh)
Baseline
63.7772
67.9811
72.185
76.3889
Jun 3, 2025
POPE Score (Zh)
POPE Score (En)
POPE Score (Es)
POPE Score (Ru)
POPE Score (Pt)
POPE Score (Bg)
POPE Score (Hi)
POPE Score (De)
Average POPE Score
Updated 3d ago
Evaluation Results
Method
Method
Links
POPE Score (Zh)
POPE Score (En)
POPE Score (Es)
POPE Score (Ru)
POPE Score (Pt)
POPE Score (Bg)
POPE Score (Hi)
POPE Score (De)
Average POPE Score
Baseline
Backbone=Qwen-VL-Chat
2025.06
79.97
82.4
66.8
72.5
-
-
60.17
72.3
70.35
VCD
Backbone=Qwen-VL-Chat
2025.06
78.77
-
69
69.63
-
-
50.8
71.9
68.02
CLAIM
Backbone=Qwen-VL-Chat
2025.06
77.2
81.1
78.73
67.73
-
-
78.87
-
76.73
CLAIM
Backbone=LLaVA-1.5
2025.06
75.4
77.7
76.17
75.67
71.97
-
-
-
75.38
VCD
Backbone=LLaVA-1.5
2025.06
68.97
-
65.17
63.9
68.67
64.07
-
-
66.16
Baseline
Backbone=LLaVA-1.5
2025.06
64.4
79.57
62.37
63.93
67.97
63.03
-
-
64.34
Feedback
Search any
task
Search any
task