Pi-transformer: A prior-informed dual-attention model for multivariate time-series anomaly detection

About

Anomalies in multivariate time series often arise from temporal context and cross-channel coordination rather than isolated outliers. We present Pi-Transformer (Prior-Informed Transformer), a transformer with two attention pathways: data-driven series attention and a smoothly evolving prior attention that encodes temporal invariants such as scale-related self-similarity and phase synchrony. The prior provides an amplitude-insensitive temporal reference that calibrates reconstruction error. During training, we pair a reconstruction objective with a divergence term that encourages agreement between the two attentions while keeping them meaningfully distinct. The prior is regularised to evolve smoothly and is lightly distilled towards dataset-level statistics. At inference, the model combines an alignment-weighted reconstruction signal (Energy) with a mismatch signal that highlights timing and phase disruptions, and fuses them into a single score for detection. Across five benchmarks (SMD, MSL, SMAP, SWaT, and PSM), Pi-Transformer achieves state-of-the-art or highly competitive F1, with particular strength on timing and phase-breaking anomalies. Case analyses show complementary behaviour of the two streams and interpretable detections around regime changes. Embedding prior attention into transformer scoring yields a calibrated and robust approach to anomaly detection in complex multivariate systems.

Sepehr Maleki, Negar Pourmoazemi• 2025

Related benchmarks

Task	Dataset	Result
Anomaly Detection	SMD	F1 Score91.23	375
Anomaly Detection	SWaT	F1 Score96.82	348
Anomaly Detection	PSM	F1 Score98.08	157
Anomaly Detection	SMAP	F1 Score97.02	114
Anomaly Detection	MSL	Precision96.24	95

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord