Deep Generic Representations for Domain-Generalized Anomalous Sound Detection
About
Developing a reliable anomalous sound detection (ASD) system requires robustness to noise, adaptation to domain shifts, and effective performance with limited training data. Current leading methods rely on extensive labeled data for each target machine type to train feature extractors using Outlier-Exposure (OE) techniques, yet their performance on the target domain remains sub-optimal. In this paper, we present \textit{GenRep}, which utilizes generic feature representations from a robust, large-scale pre-trained feature extractor combined with kNN for domain-generalized ASD, without the need for fine-tuning. \textit{GenRep} incorporates MemMixup, a simple approach for augmenting the target memory bank using nearest source samples, paired with a domain normalization technique to address the imbalance between source and target domains. \textit{GenRep} outperforms the best OE-based approach without a need for labeled data with an Official Score of 73.79\% on the DCASE2023T2 Eval set and demonstrates robustness under limited data scenarios. The code is available open-source.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Anomalous Sound Detection | DCASE 2020 (dev) | Official Performance Metric82.1 | 46 | |
| Anomalous Sound Detection | DCASE T2 DG sec-eval 2023 | HMean73.8 | 27 | |
| Anomalous Sound Detection | DCASE T2 DG 2023 (sec dev) | HMean65 | 26 | |
| Anomalous Sound Detection | DCASE T2 sec-eval 2020 | Amean82.7 | 26 | |
| Anomalous Sound Detection | DCASE T2 DG sec-eval 2024 | HMean62.4 | 25 | |
| Anomalous Sound Detection | DCASE T2 DG 2024 (dev) | HMean58.1 | 25 | |
| Anomalous Sound Detection | DCASE 2023 (dev) | Performance Metric73.8 | 17 |