Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Cross-modal Retrieval (Audio to Image) on iNaturalist Species-level Retrieval seen
Loading...
50.3
Top-1 Accuracy
BioVITA
-2.0016
11.5767
25.155
38.7333
Mar 25, 2026
Top-1 Accuracy
Top-5 Accuracy
Updated 23d ago
Evaluation Results
Method
Method
Links
Top-1 Accuracy
Top-5 Accuracy
BioVITA
Training Stage=Stage2
2026.03
50.3
77.4
BioVITA
Training Stage=Stage1
2026.03
47.8
72.8
TaxaBind
2026.03
13.3
35.8
ImageBind
2026.03
1.8
8.7
Random
2026.03
0.01
0.05
Feedback
Search any
task
Search any
task