| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| GQA POPE Popular | MESA | Accuracy86.07 | 49 | 8d ago | |
| GQA POPE Random | Scalpel | Accuracy (GQA POPE)89.93 | 42 | 8d ago | |
| GQA Adversarial | MESA | Accuracy82.73 | 40 | 8d ago | |
| A-OKVQA (Adversarial split) | SchroMind | Accuracy79.1 | 27 | 1mo ago | |
| GQA POPE (Adversarial) | MESA | Accuracy82.73 | 19 | 8d ago | |
| COCO POPE Random | Scalpel | Accuracy90.67 | 17 | 1mo ago | |
| A-OKVQA (Random split) | SchroMind | Accuracy90.83 | 12 | 1mo ago | |
| OKVQA POPE Popular | Scalpel | Accuracy85 | 11 | 1mo ago | |
| OKVQA POPE Adversarial | POPE Score (Zh)79.97 | 6 | 1mo ago | ||
| OKVQA POPE Random | CLAIM | Accuracy (Zh)86.03 | 6 | 1mo ago | |
| COCO POPE (Adversarial) | CLAIM | Score (Zh)83.27 | 6 | 1mo ago | |
| MS COCO Popular split | Scalpel | Accuracy87.87 | 5 | 1mo ago |