| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| ARO | CE-CLIP+ | Accuracy0.804 | 14 | 4d ago | |
| ARO Benchmark Visual Genome Flickr30k MS-COCO (test) | CapPa | VG Attribution89.3 | 11 | 2d ago | |
| Winoground standard (test) | GPT-4o | Text Score75.5 | 7 | 3d ago | |
| SugarCrepe++ | SPARCL | Accuracy66.1 | 5 | 4d ago | |
| Winoground (test) | Diffusion Classifier | Object Score0.461 | 4 | 4d ago | |
| ARO (test) | syn-CLIP | VG-Rel71.4 | 4 | 3d ago | |
| Winoground 1.0 (test) | IAIS | Text Score42.5 | 3 | 3d ago | |
| Winoground clean (no-tag) | CyCLIP | Text Score32.16 | 2 | 3d ago |