Cultural Bias and Cultural Alignment of Large Language Models

About

Culture fundamentally shapes people's reasoning, behavior, and communication. As people increasingly use generative artificial intelligence (AI) to expedite and automate personal and professional tasks, cultural values embedded in AI models may bias people's authentic expression and contribute to the dominance of certain cultures. We conduct a disaggregated evaluation of cultural bias for five widely used large language models (OpenAI's GPT-4o/4-turbo/4/3.5-turbo/3) by comparing the models' responses to nationally representative survey data. All models exhibit cultural values resembling English-speaking and Protestant European countries. We test cultural prompting as a control strategy to increase cultural alignment for each country/territory. For recent models (GPT-4, 4-turbo, 4o), this improves the cultural alignment of the models' output for 71-81% of countries and territories. We suggest using cultural prompting and ongoing evaluation to reduce cultural bias in the output of generative AI.

Yan Tao, Olga Viberg, Ryan S. Baker, Rene F. Kizilcec• 2023

Related benchmarks

Task	Dataset	Result
Binary decision task	CGSS China (test)	Accuracy66.69	24
Binary decision task	ISD India (test)	Accuracy74.52	24
Binary decision task	AFRO Africa (test)	Accuracy55.05	24
Binary decision task	EVS Europe (test)	Accuracy58.92	24
Binary decision task	GSS United States (test)	Accuracy0.5326	24
Binary decision task	LAPOP (test)	Accuracy58.52	24

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord