| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Flickr30K | EVA-CLIP-E | R@194.9 | 531 | 3d ago | |
| Flickr30k (test) | BLIP-2 | Recall@189.7 | 445 | 1mo ago | |
| Flickr30K 1K (test) | ERNIE-ViL 2.0 | R@193.3 | 432 | 3d ago | |
| MSCOCO 5K (test) | MAP | R@160.9 | 308 | 19d ago | |
| MS-COCO 5K (test) | BLIP-2 ViT-g | R@168.3 | 244 | 16d ago | |
| COCO | EVA-CLIP-8B | Recall@170.2 | 156 | 4d ago | |
| MS-COCO | L2RM-SGRAF | R@165.7 | 151 | 20d ago | |
| MSCOCO | BLIP | R@164.3 | 123 | 22d ago | |
| MSCOCO 1K (test) | AltCLIP | R@16,390 | 118 | 19d ago | |
| Flickr30K-CN | R2D2 | R@184.4 | 99 | 1mo ago | |
| CUHK-PEDES (test) | CADA-L | Recall@178.37 | 96 | 1mo ago | |
| DCI | Qwen3-VL-Embedding | R@179.7 | 79 | 19d ago | |
| RSITMD (test) | GeoRSCLIP-FT | R@125.04 | 77 | 1mo ago | |
| MS-COCO (test) | K-LITE | R@12,208 | 72 | 29d ago | |
| Flickr30k (1K) | ALIGN | R@184.9 | 63 | 6d ago | |
| Flickr30k (val) | ITO | R@167.1 | 51 | 1mo ago | |
| MSCOCO (val) | ITO | R@138.97 | 51 | 1mo ago | |
| MS-COCO 1K | CRCL | R@165.1 | 51 | 1mo ago | |
| RSICD (test) | GeoRSCLIP-FT | R@115.59 | 50 | 1mo ago | |
| COCO-CN | M2-Encoder | R@178.7 | 49 | 1mo ago | |
| CC152K | L2RM-SGRAF | R@142.8 | 48 | 1mo ago | |
| NWPU (test) | RRSITR | R@113.62 | 44 | 18d ago | |
| COCO 5K (test) | R@164.3 | 43 | 3d ago | ||
| MSCOCO (5K) | AMoE | R@153.98 | 42 | 1mo ago | |
| Urban-1K | LamRA | R@198.8 | 40 | 13d ago |