Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Causal Prompting: Debiasing Large Language Model Prompting based on Front-Door Adjustment

About

Despite the notable advancements of existing prompting methods, such as In-Context Learning and Chain-of-Thought for Large Language Models (LLMs), they still face challenges related to various biases. Traditional debiasing methods primarily focus on the model training stage, including approaches based on data augmentation and reweighting, yet they struggle with the complex biases inherent in LLMs. To address such limitations, the causal relationship behind the prompting methods is uncovered using a structural causal model, and a novel causal prompting method based on front-door adjustment is proposed to effectively mitigate LLMs biases. In specific, causal intervention is achieved by designing the prompts without accessing the parameters and logits of LLMs. The chain-of-thought generated by LLM is employed as the mediator variable and the causal effect between input prompts and output answers is calculated through front-door adjustment to mitigate model biases. Moreover, to accurately represent the chain-of-thoughts and estimate the causal effects, contrastive learning is used to fine-tune the encoder of chain-of-thought by aligning its space with that of the LLM. Experimental results show that the proposed causal prompting approach achieves excellent performance across seven natural language processing datasets on both open-source and closed-source LLMs.

Congzhi Zhang, Linhai Zhang, Jialong Wu, Yulan He, Deyu Zhou• 2024

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningGSM8K
Accuracy84.18
983
Mathematical ReasoningMATH
Accuracy48.36
643
Fact VerificationFEVER
Accuracy0.7713
67
Multi-hop Question AnsweringHotpotQA
F172.51
48
Question AnsweringStrQA
Accuracy73.97
24
Question AnsweringComQA
Accuracy78.03
18
Showing 6 of 6 rows

Other info

Follow for update