Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

State Space Models for Bioacoustics: A Comparative Evaluation with Transformers

About

In this study, we evaluate the efficacy of the Mamba architecture bioacoustics by introducing BioMamba, a Mamba-based audio representation model for wildlife sounds. We pre-train a BioMamba using self-supervised learning on a large audio corpus and evaluate it on the BEANS benchmark across diverse classification and detection tasks. Compared to the state-of-the-art Transformer-based model (AVES), BioMamba achieves comparable performance while significantly reducing VRAM consumption. Our results demonstrate Mamba's potential as a computationally efficient alternative for real-world environmental monitoring.

Chengyu Tang, Sanjeev Baskiyar• 2025

Related benchmarks

TaskDatasetResultRank
DetectionBeans
dcase0.426
7
ClassificationBeans
Accuracy (bats)72.5
7
Showing 2 of 2 rows

Other info

Follow for update