Deep Generic Representations for Domain-Generalized Anomalous Sound Detection

About

Developing a reliable anomalous sound detection (ASD) system requires robustness to noise, adaptation to domain shifts, and effective performance with limited training data. Current leading methods rely on extensive labeled data for each target machine type to train feature extractors using Outlier-Exposure (OE) techniques, yet their performance on the target domain remains sub-optimal. In this paper, we present \textit{GenRep}, which utilizes generic feature representations from a robust, large-scale pre-trained feature extractor combined with kNN for domain-generalized ASD, without the need for fine-tuning. \textit{GenRep} incorporates MemMixup, a simple approach for augmenting the target memory bank using nearest source samples, paired with a domain normalization technique to address the imbalance between source and target domains. \textit{GenRep} outperforms the best OE-based approach without a need for labeled data with an Official Score of 73.79\% on the DCASE2023T2 Eval set and demonstrates robustness under limited data scenarios. The code is available open-source.

Phurich Saengthong, Takahiro Shinozaki• 2024

Related benchmarks

Task	Dataset	Result
Anomalous Sound Detection	DCASE 2020 (dev)	Official Performance Metric82.1	46
Anomalous Sound Detection	DCASE T2 DG sec-eval 2023	HMean73.8	27
Anomalous Sound Detection	DCASE T2 DG 2023 (sec dev)	HMean65	26
Anomalous Sound Detection	DCASE T2 sec-eval 2020	Amean82.7	26
Anomalous Sound Detection	DCASE T2 DG sec-eval 2024	HMean62.4	25
Anomalous Sound Detection	DCASE T2 DG 2024 (dev)	HMean58.1	25
Anomalous Sound Detection	DCASE 2023 (dev)	Performance Metric73.8	17

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord