Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-Agent Causal Discovery Using Large Language Models

About

Causal discovery aims to identify causal relationships between variables and is a fundamental problem across the sciences. Traditional statistical causal discovery (SCD) methods rely solely on observational data and ignore the contextual information available in metadata, whereas recent LLM-based methods exploit metadata but treat the large language model (LLM) as a single agent, leaving its judgments vulnerable to memorized or biased associations. To address this gap, we introduce MAC (Multi-Agent Causal Discovery Framework), which casts causal discovery as a multi-agent debate coupled with the autonomous selection of an SCD algorithm. MAC combines two complementary modules, bridged by a Meta Fusion mechanism: a Debate-Coding Module (DCM) that grounds an initial graph in data by autonomously selecting and executing the best-suited SCD algorithm, and a Meta-Debate Module (MDM) that refines the graph through an adversarial Affirmative-Negative-Judge debate over the metadata. Across five benchmark datasets and three metrics (F1, SHD, NHD), MAC achieves the best aggregate performance among five statistical and four LLM-based baselines, ranking first on 10 of 15 evaluation points with Gemini-2.0-Flash -- including a perfect reconstruction of the Earthquake graph -- and remains robust across three backbone LLMs.

Hao Duong Le, Xin Xia, Haijie Xu, Chen Zhang• 2024

Related benchmarks

TaskDatasetResultRank
Causal DiscoveryChild
F1 Score65
35
Causal DiscoveryCancer
F1 Score90
34
Causal DiscoverySachs real-world data protein signaling network
SHD21
32
Causal DiscoveryAuto
SHD4.33
30
Causal DiscoveryAutoMPG--
16
Causal DiscoveryDWDClimate--
16
Showing 6 of 6 rows

Other info

Follow for update