Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Object Hallucination Probing on GQA Adversarial

82.73Accuracy

MESA

66.786870.925975.06579.2041Dec 22, 2025Jan 9, 2026Jan 27, 2026Feb 14, 2026Mar 4, 2026Mar 22, 2026Apr 9, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2026.04
82.7381.93
2026.02
81.7682.31
2026.04
81.6180.75
2026.02
8182.09
2026.04
8180.06
2026.04
80.4778.44
2026.04
80.0379.75
2026.04
80.0180.75
2026.04
79.681.35
2026.04
79.1381.07
2026.04
79.180.41
2025.12
78.480.5
2026.04
78.3379.78
2026.02
77.480.11
2026.02
77.480.11
2025.12
77.278.3
2025.12
76.478.1
2026.02
76.0978.78
2026.02
76.0978.78
2026.04
76.0978.78
2026.04
75.976.89
2025.12
75.876.5
2025.12
75.474.7
2025.12
75.376.1
2025.12
75.277.7
2026.02
75.0876.06
2026.02
75.0876.06
2026.02
7578.71
2026.02
7578.71
2025.12
74.875.8
2026.04
73.6165.59
2026.04
73.5365.53
2025.12
73.576
2025.12
73.176.1
2025.12
72.875.5
2026.04
72.176.37
2025.12
70.474.1
2026.02
69.275.58
2026.02
68.1374.78
2026.04
67.474.31