| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| A-OKVQA POPE (Adversarial) | SIRA | Accuracy0.8363 | 42 | 2d ago | |
| MSCOCO | mPLUG-Owl | CHAIR Instance Score30.2 | 38 | 3mo ago | |
| COCO 500 images | DoLa | CHAIR Score (Scene)74.4 | 36 | 6d ago | |
| AMBER | HALC | CHAIR_I16.2 | 35 | 2mo ago | |
| MSCOCO (500 random samples) | Cs53 | 25 | 1mo ago | ||
| CHAIR (test) | DoLa | CS (%)58.2 | 9 | 1mo ago | |
| MMHalbench | FLB | Average Score2.23 | 3 | 2mo ago | |
| ShareGPT4V | CHAIR-S Score46.8 | 3 | 3mo ago | ||
| MiniGPT-4 | CHAIR Score (S)45.9 | 3 | 3mo ago |