Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Object Hallucination Probing on COCO POPE (Adversarial)
Loading...
83.27
Score (Zh)
CLAIM
73.0052
75.6701
78.335
80.9999
Jun 3, 2025
Score (Zh)
Score (En)
Score (Es)
Score (Ru)
Score (Pt)
Score (Bg)
Score (Hi)
Score (De)
Avg Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Score (Zh)
Score (En)
Score (Es)
Score (Ru)
Score (Pt)
Score (Bg)
Score (Hi)
Score (De)
Avg Score
CLAIM
Backbone=LLaVA-1.5
2025.06
83.27
77.67
-
79.43
79.67
74.57
-
-
78.92
Baseline
Backbone=Qwen-VL-Chat
2025.06
80.87
83.9
67.53
72.8
-
-
63.1
75.9
72.04
CLAIM
Backbone=Qwen-VL-Chat
2025.06
80.7
82.33
77.53
69.7
-
-
81.5
-
78.35
VCD
Backbone=Qwen-VL-Chat
2025.06
79.4
-
71.97
70.83
-
-
54.73
75.03
70.39
VCD
Backbone=LLaVA-1.5
2025.06
74.27
-
67.3
66.47
73.63
66.33
-
-
69.6
Baseline
Backbone=LLaVA-1.5
2025.06
73.4
85.2
62.87
66.07
73.7
65.93
-
-
68.39
Feedback
Search any
task
Search any
task