Graph Feature Preprocessor: Real-time Subgraph-based Feature Extraction for Financial Crime Detection
About
In this paper, we present "Graph Feature Preprocessor", a software library for detecting typical money laundering patterns in financial transaction graphs in real time. These patterns are used to produce a rich set of transaction features for downstream machine learning training and inference tasks such as detection of fraudulent financial transactions. We show that our enriched transaction features dramatically improve the prediction accuracy of gradient-boosting-based machine learning models. Our library exploits multicore parallelism, maintains a dynamic in-memory graph, and efficiently mines subgraph patterns in the incoming transaction stream, which enables it to be operated in a streaming manner. Our solution, which combines our Graph Feature Preprocessor and gradient-boosting-based machine learning models, can detect illicit transactions with higher minority-class F1 scores than standard graph neural networks in anti-money laundering and phishing datasets. In addition, the end-to-end throughput rate of our solution executed on a multicore CPU outperforms the graph neural network baselines executed on a powerful V100 GPU. Overall, the combination of high accuracy, a high throughput rate, and low latency of our solution demonstrates the practical value of our library in real-world applications.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Money laundering detection | IBM AML HI Small | F1 Score (Minority Class)64.77 | 6 | |
| Money laundering detection | IBM AML HI Medium | Minority Class F1 Score65.69 | 6 | |
| Money laundering detection | IBM AML LI Small | Minority Class F1 Score28.25 | 6 | |
| Money laundering detection | IBM AML LI Medium | Minority Class F1 Score31.03 | 6 | |
| Money laundering detection | IBM AML HI Large | Minority Class F1 Score58.03 | 5 | |
| Money laundering detection | IBM AML LI Large | Minority Class F1 Score (%)24.23 | 5 | |
| Phishing Detection | ETH Phishing | Minority Class F1 Score51 | 4 |