Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Cross-modal retrieval benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Cross-modal retrieval
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
AEA 553 events (test)
SigLIP 2
Top-5 Recall
54.6
48
18d ago
RSICD (test)
Full-FT GeoRSCLIP
Image-to-Text R@1
21.13
32
1mo ago
Clotho (test)
CART
R@1
46.4
29
1mo ago
RSITMD (test)
HarMA
R@1 (Image-to-Text)
32.74
28
1mo ago
Flickr30k (test)
RCAR
Image-to-text Recall@1
82.3
25
1mo ago
AudioCaps (test)
OmniBind
R@1
59.1
23
1mo ago
MSR-VTT (test)
HoPA
R@1 (V→T)
37.3
19
5d ago
MSCOCO 1K
MURAL-LARGE
Mean Recall (ja)
91.6
16
1mo ago
InstVL video (Global)
InstAP
T2V R@1
94.5
12
9d ago
InstVL video (Instance)
InstAP
T2V Recall@1
60.63
12
9d ago
InstVL img-zero 10K (Global)
InstAP
T2V R@1
83.33
12
9d ago
InstVL img-zero 10K
InstAP
T2V R@1
28.25
12
9d ago
InstVL img-zero 1K (Global)
InstAP
T2V Recall@1
88.7
12
9d ago
InstVL img-zero 1K (Instance)
InstAP
T2V R@1
41.94
12
9d ago
InstVL img 10K (Global)
InstAP
T2V Recall@1
95.77
12
9d ago
InstVL img 10K
InstAP
T2V Recall@1
44.05
12
9d ago
InstVL img 1K (Global)
InstAP
T2V R@1
99.2
12
9d ago
InstVL img 1K Instance
InstAP
T2V R@1
50.25
12
9d ago
MSCOCO 5K (test)
DSMD
R@1 (I2T)
0.621
12
1mo ago
UCM (test)
FBCLM
R@1 (I2T)
28.57
12
1mo ago
MSCOCO (5K)
ALIGN-L2
Mean Recall (ja)
83.4
12
1mo ago
MS-COCO 1K image folds (test)
FedAFD
RSum@1
59.8
8
1mo ago
MS-COCO (test)
FedAFD
R@1 (I2T)
33.98
8
1mo ago
MSR-VTT 3 modal
CLIP
Gap
29
7
1mo ago
Video-QA
SPI-Multimodal
R@10
77.2
6
19d ago
Showing 25 of 37 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs