Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

VQA Hallucination on POPE MS COCO v1 (test)

86Random Accuracy

Robust mPLUG-Owl-7B

50.6459.826978.18Mar 20, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.03
86-73-65-74.7-
2024.03
85.238.483.93882.340.583.839
2024.03
85.253.479.157.573.267.579.251.1
2024.03
84.839.683.341.880.74482.941.8
2024.03
84.355.67761.671.368.277.561.8
2024.03
81.265.673.967.368.275.474.469.4
2024.03
7667.769.373.365.877.670.372.9
2024.03
74.875.161.886.758.190.164.984
2024.03
73-67-62-74.7-
2024.03
67.980.663.883.259.887.363.883.7
2024.03
52-57-60-67.3-