Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Few-shot Anomaly Detection in Text with Deviation Learning

About

Most current methods for detecting anomalies in text concentrate on constructing models solely relying on unlabeled data. These models operate on the presumption that no labeled anomalous examples are available, which prevents them from utilizing prior knowledge of anomalies that are typically present in small numbers in many real-world applications. Furthermore, these models prioritize learning feature embeddings rather than optimizing anomaly scores directly, which could lead to suboptimal anomaly scoring and inefficient use of data during the learning process. In this paper, we introduce FATE, a deep few-shot learning-based framework that leverages limited anomaly examples and learns anomaly scores explicitly in an end-to-end method using deviation learning. In this approach, the anomaly scores of normal examples are adjusted to closely resemble reference scores obtained from a prior distribution. Conversely, anomaly samples are forced to have anomalous scores that considerably deviate from the reference score in the upper tail of the prior. Additionally, our model is optimized to learn the distinct behavior of anomalies by utilizing a multi-head self-attention layer and multiple instance learning approaches. Comprehensive experiments on several benchmark datasets demonstrate that our proposed approach attains a new level of state-of-the-art performance.

Anindya Sundar Das, Aravind Ajay, Sriparna Saha, Monowar Bhuyan• 2023

Related benchmarks

TaskDatasetResultRank
Text Anomaly DetectionTAD-SMSSpam
AUROC0.9518
25
Text Anomaly DetectionAGNews
AUPRC73.67
25
Text Anomaly DetectionTAD-HateSpeech
AUROC0.6774
25
Text Anomaly DetectionNLPAD-AGNews
AUROC88.37
25
Text Anomaly DetectionNLPAD-N24News
AUROC87.7
25
Text Anomaly DetectionTAD-OLID
AUROC0.5555
25
Text Anomaly DetectionNLPAD MovieReview
AUROC0.577
25
Text Anomaly DetectionNLPAD-BBCNews
AUROC0.8221
25
Text Anomaly DetectionTAD-CovidFake
AUROC0.8331
25
Text Anomaly DetectionTAD-Liar2
AUROC0.6424
25
Showing 10 of 15 rows

Other info

Follow for update