Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Object Hallucination Evaluation on A-OKVQA POPE (Random)

92.1Accuracy

SIRA

59.620868.052976.48584.9171Jan 3, 2025Mar 26, 2025Jun 17, 2025Sep 8, 2025Nov 29, 2025Feb 20, 2026May 14, 2026
Updated 2d ago

Evaluation Results

MethodLinks
2026.05
92.1--91.93
2026.05
90.55--90.6
2025.12
89.5--89.4
2025.12
89.5--89.6
2026.05
89.22--89.01
2026.05
89.13--89.52
2025.12
89--89.4
2025.12
89--88.2
2026.05
89--88.71
2026.05
88.96--87.9
2025.12
88.9485.32-89.21
2025.12
88.6783.63-89.38
2025.01
88.5390.1986.4788.29
2026.05
88.49--87.93
2025.12
88.4--87.7
2025.12
88.4--88.5
2025.12
88.3391.46-88.31
2025.12
88.3--88.4
2025.12
88.2--87.4
2026.05
88.19--88.43
2025.12
88.1392.06-87.55
2026.05
88.03--87
2025.12
87.989.16-87.58
2025.12
87.8790.06-87.53
2025.12
87.7392.49-87.01
2025.12
87.6--87.7
2026.05
87.45--87.55
2025.12
87.484.67-88.02
2025.12
87.4--88
2026.05
87.4--86.46
2025.12
87.2--88.2
2025.12
87.1383.92-87.71
2025.01
87.0388.7184.8786.75
2025.12
86.6--85.3
2026.05
86.47--86.52
2026.05
86.4--85.07
2025.12
86.3--87.4
2025.12
86.3--87.2
2025.12
86.2790.66-85.48
2025.01
86.291.0780.2785.33
2025.12
86.1780.84-87.27
2026.05
86.15--86.34
2025.01
86.1585.1887.5386.34
2025.01
85.8283.888.9486.29
2025.12
85.7--86.9
2025.12
85.4381.77-86.23
2025.12
85.1779.79-86.4
2025.12
84.6779.25-85.97
2025.12
84.280.9-85
2025.12
83.8378.05-85.34
2025.01
83.6981.8486.6184.56
2026.05
83.45--82.56
2025.01
83.4587.2478.3682.56
2026.05
83.23--84.83
2025.12
81.976.63-83.53
2025.01
80.9177.9786.1681.86
2025.12
80.6376.82-81.92
2026.01
61.8363.1355.659.13
2026.01
61.863.2756.2759.56
2026.01
60.8762.7953.3357.68