Improving the Distributional Alignment of LLMs using Supervision
About
The ability to accurately align LLMs with diverse population groups on subjective questions would have great value. In this work, we show that adding simple supervision can more consistently improve the alignment of LLM-generated distributions with diverse population groups, as measured across three datasets spanning public health, public opinion, and values and beliefs. Beyond evaluating average alignment, we also report how alignment varies across specific groups. Our broad findings provide insights into the distributional alignment of LLM generations with diverse populations. By conducting evaluation over many LLMs and prompting strategies, we provide a benchmark to stimulate future research.
Gauri Kambhatla, Sanjana Gautam, Angela Zhang, Alex Liu, Ravi Srinivasan, Junyi Jessy Li, Matthew Lease• 2025
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Opinion Alignment | WGM | Opinion Alignment89.8 | 60 | |
| Opinion Alignment | OQA | Opinion Alignment91.6 | 8 | |
| Opinion Alignment | WVS | Opinion Alignment82 | 7 | |
| Opinion Alignment | Average WGM, OQA, WVS | Opinion Alignment86.2 | 3 |
Showing 4 of 4 rows