Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Cross-modal Retrieval (Audio to Text) on iNaturalist Species-level Retrieval seen
Loading...
63.7
Top-1 Accuracy
BioVITA
-2.5376
14.6587
31.855
49.0513
Mar 25, 2026
Top-1 Accuracy
Top-5 Accuracy
Updated 23d ago
Evaluation Results
Method
Method
Links
Top-1 Accuracy
Top-5 Accuracy
BioVITA
Training Stage=Stage2
2026.03
63.7
83.8
BioVITA
Training Stage=Stage1
2026.03
60.3
80
TaxaBind
2026.03
9.6
28.1
ImageBind
2026.03
2
8.9
CLAP
2026.03
0.7
4.7
Random
2026.03
0.01
0.05
Feedback
Search any
task
Search any
task