Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Prompting Techniques for Reducing Social Bias in LLMs through System 1 and System 2 Cognitive Processes

About

Dual process theory posits that human cognition arises via two systems. System 1, which is a quick, emotional, and intuitive process, which is subject to cognitive biases, and System 2, is a slow, onerous, and deliberate process. Prior research in LLMs found that using chain-of-thought (CoT) prompting in LLMs, which has been often compared to System 2 reasoning, can lead to reduced gender bias. Along these lines, we investigate the relationship between bias, CoT prompting, a direct debiasing, and dual process theory modeling in LLMs. We compare zero-shot CoT, debiasing, and dual process theory-based prompting strategies on two bias datasets spanning nine different social bias categories. We incorporate human and machine personas to determine whether LLM modeling of the effects of dual process theory exist independent of explicit persona models or are tied to the LLM's modeling of human-like generation. We find that a human persona, debiasing, System 2, and CoT prompting all tend to reduce social biases in LLMs, though the best combination of features depends on the exact model and bias category -- resulting in up to a 33 percent drop in stereotypical judgments by an LLM.

Mahammed Kamruzzaman, Gene Louis Kim• 2024

Related benchmarks

TaskDatasetResultRank
Evaluation-based Bias ReductionBias Reduction Benchmark (Evaluation)
Bias Reduction Performance99.8
35
Memory-based Bias ReductionBias Reduction Benchmark Memory
Bias Reduction Performance50.4
35
Memory Fidelity EvaluationMemory-based Experiment Seen Features
P-Diff0.03
32
In-hospital mortality predictionMIMIC-IV (test)
Accuracy75.3
10
Showing 4 of 4 rows

Other info

Follow for update