Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Cross-modal Retrieval (Image to Audio) on iNaturalist Species-level Retrieval seen
Loading...
57.5
Top-1 Accuracy
BioVITA
-2.2896
13.2327
28.755
44.2773
Mar 25, 2026
Top-1 Accuracy
Top-5 Accuracy
Updated 23d ago
Evaluation Results
Method
Method
Links
Top-1 Accuracy
Top-5 Accuracy
BioVITA
Training Stage=Stage2
2026.03
57.5
85.6
BioVITA
Training Stage=Stage1
2026.03
48.6
78.8
TaxaBind
2026.03
16.2
41.4
ImageBind
2026.03
2.9
12.2
Random
2026.03
0.01
0.05
Feedback
Search any
task
Search any
task