Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CausalTAD: Injecting Causal Knowledge into Large Language Models for Tabular Anomaly Detection

About

Detecting anomalies in tabular data is critical for many real-world applications, such as credit card fraud detection. With the rapid advancements in large language models (LLMs), state-of-the-art performance in tabular anomaly detection has been achieved by converting tabular data into text and fine-tuning LLMs. However, these methods randomly order columns during conversion, without considering the causal relationships between them, which is crucial for accurately detecting anomalies. In this paper, we present CausalTaD, a method that injects causal knowledge into LLMs for tabular anomaly detection. We first identify the causal relationships between columns and reorder them to align with these causal relationships. This reordering can be modeled as a linear ordering problem. Since each column contributes differently to the causal relationships, we further propose a reweighting strategy to assign different weights to different columns to enhance this effect. Experiments across more than 30 datasets demonstrate that our method consistently outperforms the current state-of-the-art methods. The code for CausalTAD is available at https://github.com/350234/CausalTAD.

Ruiqi Wang, Ruikang Liu, Runyu Chen, Haoxiang Suo, Zhiyi Peng, Zhuo Tang, Changjian Chen• 2026

Related benchmarks

TaskDatasetResultRank
Anomaly DetectionWBC
ROCAUC0.969
87
Tabular Anomaly Detectionpima
AUC ROC0.667
53
Tabular Anomaly DetectionBreastW
AUC-ROC0.997
50
Tabular Anomaly Detectionionosphere
AUC-ROC25.2
50
Tabular Anomaly DetectionWine
AUC-ROC1
39
Tabular Anomaly DetectionOptdigits
AUC-ROC0.972
39
Tabular Anomaly Detectionpendigits
AUC-ROC89.2
39
Tabular Anomaly DetectionVertebral
AUC-ROC63.1
33
Tabular Anomaly DetectionLymphography
AUC-ROC1
30
Tabular Anomaly DetectionSeismic
AUC-ROC0.791
30
Showing 10 of 40 rows

Other info

Follow for update