Whose Opinions Do Language Models Reflect?

About

Language models (LMs) are increasingly being used in open-ended contexts, where the opinions reflected by LMs in response to subjective queries can have a profound impact, both on user satisfaction, as well as shaping the views of society at large. In this work, we put forth a quantitative framework to investigate the opinions reflected by LMs -- by leveraging high-quality public opinion polls and their associated human responses. Using this framework, we create OpinionsQA, a new dataset for evaluating the alignment of LM opinions with those of 60 US demographic groups over topics ranging from abortion to automation. Across topics, we find substantial misalignment between the views reflected by current LMs and those of US demographic groups: on par with the Democrat-Republican divide on climate change. Notably, this misalignment persists even after explicitly steering the LMs towards particular demographic groups. Our analysis not only confirms prior observations about the left-leaning tendencies of some human feedback-tuned LMs, but also surfaces groups whose opinions are poorly reflected by current LMs (e.g., 65+ and widowed individuals). Our code and data are available at https://github.com/tatsu-lab/opinions_qa.

Shibani Santurkar, Esin Durmus, Faisal Ladhak, Cinoo Lee, Percy Liang, Tatsunori Hashimoto• 2023

Related benchmarks

Task	Dataset	Result
Public Opinion Simulation	WVS	KL (Beliefs & Life)0.8621	28
Public Opinion Simulation	WVS (World Values Survey) (test)	Beliefs & Life JS Divergence0.5491	28
Survey Simulation	ESS 9	Wasserstein Distance (WD)0.773	16
Survey Simulation	CFPS	Weighted Distance (WD)0.778	16
Survey Simulation	CGSS	Weighted Distance (WD)0.723	16
Survey Simulation	ESS11	Weighted Distance (WD)0.799	16
Survey Simulation	WVS	Weighted Distance (WD)0.772	16
Dictator Game	Iyengar and Westwood	Dem Δ1.66	13
Trust Game	Carlin and Love	Dem Δ1.07	13
Trust Game	Whitt 2021	Dem Delta2.21	13

Showing 10 of 17 rows

Other info

Follow for update

@wizwand_team Discord