| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| COCO (val) | DINOv2-ARL | R@160.1 | 43 | 4d ago | |
| RSICD (test) | mR10.43 | 43 | 3d ago | ||
| COCO (test) | BLIP_FUSECAP | Recall@197.2 | 37 | 3d ago | |
| MSCOCO (test) | CCLM | EN Retrieval Score95.6 | 28 | 4d ago | |
| RSICD | GeoMELT | Mean Recall40.72 | 26 | 2d ago | |
| Flickr30K | MDCS-SGA | R@1100 | 25 | 4d ago | |
| COCO | ACED-F2 | Retrieval Score58.3 | 21 | 3d ago | |
| Flickr30K (test) | Uni-Perceiver-L + Conditional MoEs | R@1 (Img->Txt)94.1 | 21 | 2d ago | |
| MSCOCO 5K | HarmoCLIP | I-T Score69.78 | 18 | 2d ago | |
| MSCOCO | CLIP | MR66.7 | 18 | 4d ago | |
| Flickr30K | CLIP | MR90.1 | 18 | 4d ago | |
| CBVS-20K (test) | UniCLIP | R@150.3 | 16 | 4d ago | |
| MIMIC 5x200 | LGDEA | Precision@156.31 | 15 | 2d ago | |
| RSITMD | RemoteCLIP (ViT-L) | R@1 (I2T)28.76 | 11 | 4d ago | |
| Retrieval | Avg Recall74.78 | 11 | 4d ago | ||
| Quilt | PathGen | R@132.46 | 10 | 4d ago | |
| SlideBench | PathFLIP (Ours) | Recall@115.13 | 10 | 3d ago | |
| MS-COCO (test) | BLIP-2 | Rt@10.8532 | 10 | 3d ago | |
| CC152K | SREM | Sum372.2 | 10 | 3d ago | |
| Flickr30K 1K (test) | ALBEF (14M) | IR@185.6 | 10 | 4d ago | |
| Fashion-Gen | Kaleido-BERT | Rank@127.99 | 10 | 3d ago | |
| Fashion-IQ (test) | Avg Recall@(10, 50)48.53 | 10 | 4d ago | ||
| UCM | RemoteCLIP (ViT-B) | I2T R@120.48 | 9 | 4d ago | |
| MIMIC-CXR 5x200 | MedUnifier | mAP@160.7 | 9 | 3d ago | |
| MS COCO | SigLIP | Recall@152.7 | 9 | 3d ago |