Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Adaptive graph-based algorithms for conditional anomaly detection and semi-supervised learning

About

We develop graph-based methods for semi-supervised learning based on label propagation on a data similarity graph. When data is abundant or arrive in a stream, the problems of computation and data storage arise for any graph-based method. We propose a fast approximate online algorithm that solves for the harmonic solution on an approximate graph. We show, both empirically and theoretically, that good behavior can be achieved by collapsing nearby points into a set of local representative points that minimize distortion. Moreover, we regularize the harmonic solution to achieve better stability properties. We also present graph-based methods for detecting conditional anomalies and apply them to the identification of unusual clinical actions in hospitals. Our hypothesis is that patient-management actions that are unusual with respect to the past patients may be due to errors and that it is worthwhile to raise an alert if such a condition is encountered. Conditional anomaly detection extends standard unconditional anomaly framework but also faces new problems known as fringe and isolated points. We devise novel nonparametric graph-based methods to tackle these problems. Our methods rely on graph connectivity analysis and soft harmonic solution. Finally, we conduct an extensive human evaluation study of our conditional anomaly methods by 15 experts in critical care.

Michal Valko• 2026

Related benchmarks

TaskDatasetResultRank
Conditional Anomaly DetectionHousing (UCI ML)
Mean Anomaly Agreement Score (AUC)71.3
15
Conditional Anomaly DetectionAuto MPG UCI ML (2/3, 1/3) train-test split
Mean Anomaly Agreement Score (AUC)72.6
10
Conditional Anomaly DetectionWine Quality (UCI ML) 2/3, 1/3 (train-test)
Mean Anomaly Agreement Score (AUC)74.5
10
Anomaly DetectionDataset D1 synthetic
Mean Anomaly AUROC82.8
5
Anomaly DetectionDataset D2 synthetic
Mean Anomaly AUROC68.9
5
Anomaly DetectionDataset D3 synthetic
Mean Anomaly AUROC67.4
5
Showing 6 of 6 rows

Other info

Follow for update