| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| POPE | MIRROR | Accuracy94.42 | 935 | 2d ago | |
| POPE Adversarial offline | F1 Score68.96 | 84 | 3d ago | ||
| POPE Popular offline | OPERA | F1 Score84.43 | 84 | 3d ago | |
| POPE Random offline | F1 Score73.6 | 84 | 3d ago | ||
| MS-COCO (POPE Adversarial) | AVISC | Accuracy87.62 | 80 | 3d ago | |
| MS-COCO POPE (Popular) | AVISC | Accuracy90.76 | 76 | 3d ago | |
| MSCOCO 2014 (val) | CHAIRs54.6 | 55 | 3d ago | ||
| MS-COCO POPE Random | AVISC | Accuracy92.36 | 55 | 3d ago | |
| CHAIR | Dola | CS Score57 | 49 | 3d ago | |
| POPE (test) | Accuracy90.6 | 44 | 2d ago | ||
| A-OKVQA POPE (Popular) | CoFi-Dec | Accuracy87.71 | 36 | 3d ago | |
| A-OKVQA POPE (Random) | HDD | Accuracy89.5 | 36 | 3d ago | |
| POPE GQA Popular | HDD | Accuracy86.8 | 30 | 3d ago | |
| MSCOCO CHAIR | LLaVA-1.5-7B + Vissink | CHAIR_S52.4 | 27 | 3d ago | |
| MSCOCO | RVCD | Accuracy88.54 | 21 | 3d ago | |
| POPE MSCOCO (val) | DeepSeek-VL-7B | F1 Score88.1 | 21 | 3d ago | |
| POPE GQA (Adversarial) | CoFi-Dec | Accuracy0.8169 | 18 | 3d ago | |
| CHAIR MS COCO based (test) | CHAIRs56.2 | 18 | 3d ago | ||
| POPE 28 | LAF-7B | Accuracy88.9 | 18 | 3d ago | |
| MME Existence (test) | LogicCheckGPT | Accuracy96.67 | 18 | 3d ago | |
| POPE GQA | VLI | Accuracy86.47 | 16 | 3d ago | |
| POPE A-OKVQA | VLI | Accuracy89.23 | 16 | 3d ago | |
| POPE MSCOCO | VLI | Accuracy92.58 | 16 | 3d ago | |
| CHAIR (val) | CHAIRs Score58.8 | 15 | 3d ago | ||
| COCO | M3ID | CHAIR Score (Scene)56.2 | 15 | 3d ago |