Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Cross-modal Retrieval on InstVL img-zero 1K (Global)
Loading...
88.7
T2V Recall@1
InstAP
54.276
63.213
72.15
81.087
Apr 9, 2026
T2V Recall@1
V2T Recall@1
Updated 9d ago
Evaluation Results
Method
Method
Links
T2V Recall@1
V2T Recall@1
InstAP
2026.04
88.7
88.3
VideoPrism
2026.04
85.7
85.8
UMT-L (InstVL; g)
training_corpus=InstVL...
2026.04
85.3
86.4
SigLIP
2026.04
83.9
86.5
UMT-L
2026.04
83.9
83.7
OpenCLIP
2026.04
83.4
86.9
UMT-L (InstVL; g+i)
training_corpus=all In...
2026.04
82.4
84.3
CLIP4Clip
2026.04
78.2
81.7
ViCLIP
2026.04
77.8
77.6
Coca
2026.04
67.4
70.5
MCQ
2026.04
58.9
62.7
CLIP-ViP
2026.04
55.6
73.2
Feedback
Search any
task
Search any
task