Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Causal Order: The Key to Leveraging Imperfect Experts in Causal Inference

About

Large Language Models (LLMs) have been used as experts to infer causal graphs, often by repeatedly applying a pairwise prompt that asks about the causal relationship of each variable pair. However, such experts, including human domain experts, cannot distinguish between direct and indirect effects given a pairwise prompt. Therefore, instead of the graph, we propose that causal order be used as a more stable output interface for utilizing expert knowledge. Even when querying a perfect expert with a pairwise prompt, we show that the inferred graph can have significant errors whereas the causal order is always correct. In practice, however, LLMs are imperfect experts and we find that pairwise prompts lead to multiple cycles. Hence, we propose the triplet method, a novel querying strategy that introduces an auxiliary variable for every variable pair and instructs the LLM to avoid cycles within this triplet. It then uses a voting-based ensemble method that results in higher accuracy and fewer cycles while ensuring cost efficiency. Across multiple real-world graphs, such a triplet-based method yields a more accurate order than the pairwise prompt, using both LLMs and human annotators. The triplet method enhances robustness by repeatedly querying an expert with different auxiliary variables, enabling smaller models like Phi-3 and Llama-3 8B Instruct to surpass GPT-4 with pairwise prompting. For practical usage, we show how the expert-provided causal order from the triplet method can be used to reduce error in downstream graph discovery and effect inference tasks.

Aniket Vashishtha, Abbavaram Gowtham Reddy, Abhinav Kumar, Saketh Bachu, Vineeth N Balasubramanian, Amit Sharma• 2023

Related benchmarks

TaskDatasetResultRank
Causal DiscoveryCovid linear synthetic 1
Dtop100
17
Causal DiscoveryCovid linear synthetic 2
Dtop1.67
17
Causal DiscoveryClimate linear synthetic
Dtop2
17
Causal DiscoverySachs linear synthetic
Dtop5.83
17
Causal DiscoveryGenetic linear synthetic
Dtop0.00e+0
17
Causal DiscoveryNeighbor linear synthetic
Dtop3.83
17
Causal DiscoverySupermarket linear synthetic
Dtop5
17
Causal DiscoveryAsia linear synthetic
Dtop2.75
17
Causal DiscoveryMSU linear synthetic
Dtop1.33
17
Causal DiscoveryCancer linear synthetic
Dtop0.16
17
Showing 10 of 70 rows

Other info

Follow for update