Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Sub-Band Spectral Matching with Localized Score Aggregation for Robust Anomalous Sound Detection

About

Detecting subtle deviations in noisy acoustic environments is central to anomalous sound detection (ASD). A common training-free ASD pipeline temporally pools frame-level representations into a band-preserving feature vector and scores anomalies using a single nearest-neighbor match. However, this global matching can inflate normal-score variance through two effects. First, when normal sounds exhibit band-wise variability, a single global neighbor forces all bands to share the same reference, increasing band-level mismatch. Second, cosine-based matching is energy-coupled, allowing a few high-energy bands to dominate score computation under normal energy fluctuations and further increase variance. We propose BEAM, which stores temporally pooled sub-band vectors in a memory bank, retrieves neighbors per sub-band, and uniformly aggregates scores to reduce normal-score variability and improve discriminability. We further introduce a parameter-free adaptive fusion to better handle diverse temporal dynamics in sub-band responses. Experiments on multiple DCASE Task 2 benchmarks show strong performance without task-specific training, robustness to noise and domain shifts, and complementary gains when combined with encoder fine-tuning.

Phurich Saengthong, Takahiro Shinozaki• 2026

Related benchmarks

TaskDatasetResultRank
Anomalous Sound DetectionDCASE 2020 (dev)
Official Performance Metric87.6
46
Anomalous Sound DetectionDCASE T2 DG sec-eval 2023
HMean73.1
27
Anomalous Sound DetectionDCASE T2 sec-eval 2020
Amean88.5
26
Anomalous Sound DetectionDCASE T2 DG 2023 (sec dev)
HMean66.3
26
Anomalous Sound DetectionDCASE T2 DG 2024 (dev)
HMean62.4
25
Anomalous Sound DetectionDCASE T2 DG sec-eval 2024
HMean63
25
Showing 6 of 6 rows

Other info

Follow for update