MEMTO: Memory-guided Transformer for Multivariate Time Series Anomaly Detection
About
Detecting anomalies in real-world multivariate time series data is challenging due to complex temporal dependencies and inter-variable correlations. Recently, reconstruction-based deep models have been widely used to solve the problem. However, these methods still suffer from an over-generalization issue and fail to deliver consistently high performance. To address this issue, we propose the MEMTO, a memory-guided Transformer using a reconstruction-based approach. It is designed to incorporate a novel memory module that can learn the degree to which each memory item should be updated in response to the input data. To stabilize the training procedure, we use a two-phase training paradigm which involves using K-means clustering for initializing memory items. Additionally, we introduce a bi-dimensional deviation-based detection criterion that calculates anomaly scores considering both input space and latent space. We evaluate our proposed method on five real-world datasets from diverse domains, and it achieves an average anomaly detection F1-score of 95.74%, significantly outperforming the previous state-of-the-art methods. We also conduct extensive experiments to empirically validate the effectiveness of our proposed model's key components.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Anomaly Detection | SMD | F1 Score93.54 | 217 | |
| Anomaly Detection | SWaT | F1 Score95.83 | 174 | |
| Anomaly Detection | PSM | F1 Score68 | 76 | |
| Anomaly Detection | MSL | Precision53 | 39 | |
| Anomaly Detection | PSM | Visual ROC50 | 35 | |
| Time Series Anomaly Detection | SMAP | Precision93.76 | 32 | |
| Time Series Anomaly Detection | MSL | VUS-ROC0.5 | 32 | |
| Time Series Anomaly Detection | SWaT (test) | Affiliation Precision56.47 | 25 | |
| Time Series Anomaly Detection | SMAP (test) | Affiliation Precision50.12 | 25 | |
| Anomaly Detection | SMD (test) | Precision (Aff)49.69 | 25 |