ProvAgent: Threat Detection Based on Identity-Behavior Binding and Multi-Agent Collaborative Attack Investigation
About
Advanced Persistent Threats (APTs) pose critical challenges to modern cybersecurity due to their multi-stage and stealthy nature. While provenance-based detection approaches show promise in capturing causal attack semantics, current threat provenance practices face two paradoxical issues: (1) expert skepticism, where human analysts doubt the capability of traditional detection models to identify complex attacks; and (2) expert dependence, as analysts cannot manually process large-scale raw logs to detect threats without these models. Consequently, collaboration between humans and traditional models remains the prevailing paradigm. However, this renders investigation quality contingent upon human expertise and frequently results in alert fatigue. To address these challenges, we present ProvAgent, a framework that evolves the threat provenance paradigm from human-model collaboration to a novel collaboration between multi-agent systems and traditional models. ProvAgent leverages the speed and cost-efficiency of traditional models for initial anomaly screening over large-scale logs. By enforcing fine-grained identity-behavior consistency via graph contrastive learning, it profiles entities based on specific attributes to generate high-fidelity alerts. With these alerts serving as investigation entry points, ProvAgent achieves in-depth autonomous investigation through a hypothesis-verification multi-agent framework. Evaluations with real-world datasets demonstrate that ProvAgent outperforms six state-of-the-art (SOTA) baselines in anomaly detection. Through automated investigation, ProvAgent reconstructs near-complete attack processes at a minimum cost of \$0.06 per day.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| APT attack investigation | CADETS E3 | Detected IOCs (Initial)3 | 8 | |
| Anomaly Detection | OPTC H501 | True Positives Count (TP)89 | 7 | |
| Attack Detection | DARPA THEIA E3 | True Positives (TP)71 | 7 | |
| Anomaly Detection | OPTC H051 | True Positives49 | 7 | |
| Attack Detection | DARPA E3 CADETS | True Positives (TP)19 | 7 | |
| Attack Detection | DARPA THEIA E5 | TP20 | 7 | |
| Anomaly Detection | OPTC H201 | TP102 | 7 | |
| Attack Detection | DARPA CADETS E5 | TP39 | 7 | |
| APT attack investigation | THEIA E3 | Detected IOCs (Initial)4 | 6 |