Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Object Hallucination Evaluation on POPE (average across random and popular)

91.56Accuracy (POPE)

R-CoV

5161.5372.0682.59Apr 22, 2026Apr 23, 2026Apr 24, 2026Apr 25, 2026Apr 26, 2026Apr 27, 2026Apr 28, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
91.56-91.25
2026.04
91-90.81
2026.04
89.22-88.8
2026.04
88-86.83
2026.04
87.33-87.67
2026.04
87-87
2026.04
87-87.92
2026.04
87-85.39
2026.04
86.22-84.19
2026.04
85.89-84.79
2026.04
85.88-84.51
2026.04
85.69-84.62
2026.04
85.57-84.82
2026.04
85.45-84.47
2026.04
85.14-85.01
2026.04
85.06-83.72
2026.04
85-84.84
2026.04
84.91-84.15
2026.04
84.81-84.65
2026.04
83.89-83.06
2026.04
83.69-82.92
2026.04
83.54-82.12
2026.04
83.27-82.19
2026.04
82.67-80.67
2026.04
82.21-82.85
2026.04
81.73-82.95
2026.04
81.53-82.8
2026.04
80.67-81.29
2026.04
80.41-81.64
2026.04
80.08-81.98
2026.04
79.88-81.23
2026.04
79.11-78.81
2026.04
78.67-76.93
2026.04
78.44-80.11
2026.04
75.22-74.04
2026.04
69.78-75.73
2026.04
67.44-73.75
2026.04
52.56-67.68
2026.04
-87.7-
2026.04
-88.5-
2026.04
-88.3-