Few-shot Anomaly Detection in Text with Deviation Learning

About

Most current methods for detecting anomalies in text concentrate on constructing models solely relying on unlabeled data. These models operate on the presumption that no labeled anomalous examples are available, which prevents them from utilizing prior knowledge of anomalies that are typically present in small numbers in many real-world applications. Furthermore, these models prioritize learning feature embeddings rather than optimizing anomaly scores directly, which could lead to suboptimal anomaly scoring and inefficient use of data during the learning process. In this paper, we introduce FATE, a deep few-shot learning-based framework that leverages limited anomaly examples and learns anomaly scores explicitly in an end-to-end method using deviation learning. In this approach, the anomaly scores of normal examples are adjusted to closely resemble reference scores obtained from a prior distribution. Conversely, anomaly samples are forced to have anomalous scores that considerably deviate from the reference score in the upper tail of the prior. Additionally, our model is optimized to learn the distinct behavior of anomalies by utilizing a multi-head self-attention layer and multiple instance learning approaches. Comprehensive experiments on several benchmark datasets demonstrate that our proposed approach attains a new level of state-of-the-art performance.

Anindya Sundar Das, Aravind Ajay, Sriparna Saha, Monowar Bhuyan• 2023

Related benchmarks

Task	Dataset	Result
Text Anomaly Detection	AGNews	AUPRC73.67	49
Text Anomaly Detection	TAD-SMSSpam	AUROC0.9518	25
Text Anomaly Detection	TAD-HateSpeech	AUROC0.6774	25
Text Anomaly Detection	NLPAD-AGNews	AUROC88.37	25
Text Anomaly Detection	NLPAD-N24News	AUROC87.7	25
Text Anomaly Detection	TAD-OLID	AUROC0.5555	25
Text Anomaly Detection	NLPAD MovieReview	AUROC0.577	25
Text Anomaly Detection	NLPAD-BBCNews	AUROC0.8221	25
Text Anomaly Detection	TAD-CovidFake	AUROC0.8331	25
Text Anomaly Detection	TAD-Liar2	AUROC0.6424	25

Showing 10 of 18 rows

Other info

Follow for update

@wizwand_team Discord