Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Cultural Bias and Cultural Alignment of Large Language Models

About

Culture fundamentally shapes people's reasoning, behavior, and communication. As people increasingly use generative artificial intelligence (AI) to expedite and automate personal and professional tasks, cultural values embedded in AI models may bias people's authentic expression and contribute to the dominance of certain cultures. We conduct a disaggregated evaluation of cultural bias for five widely used large language models (OpenAI's GPT-4o/4-turbo/4/3.5-turbo/3) by comparing the models' responses to nationally representative survey data. All models exhibit cultural values resembling English-speaking and Protestant European countries. We test cultural prompting as a control strategy to increase cultural alignment for each country/territory. For recent models (GPT-4, 4-turbo, 4o), this improves the cultural alignment of the models' output for 71-81% of countries and territories. We suggest using cultural prompting and ongoing evaluation to reduce cultural bias in the output of generative AI.

Yan Tao, Olga Viberg, Ryan S. Baker, Rene F. Kizilcec• 2023

Related benchmarks

TaskDatasetResultRank
Binary decision taskCGSS China (test)
Accuracy66.69
24
Binary decision taskISD India (test)
Accuracy74.52
24
Binary decision taskAFRO Africa (test)
Accuracy55.05
24
Binary decision taskEVS Europe (test)
Accuracy58.92
24
Binary decision taskGSS United States (test)
Accuracy0.5326
24
Binary decision taskLAPOP (test)
Accuracy58.52
24
Showing 6 of 6 rows

Other info

Follow for update