Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mitigating Cross-Lingual Cultural Inconsistencies in LLMs via Consensus-Driven Preference Optimisation

About

Despite their impressive capabilities, multilingual large language models (MLLMs) frequently exhibit inconsistent behaviour when the prompt's language changes. While such adaptation is generally desirable, it becomes a critical failure when a user's identity is explicitly defined. For instance, given a fixed British persona and an ambiguous everyday knowledge query about literature, the prompt's language frequently overwrites the system persona -- yielding Shakespeare in English but Cervantes in Spanish. To robustly quantify this Cross-lingual Cultural Inconsistency, we introduce Singleton Fleiss's $\kappa_S$, a metric mathematically resilient to hallucinations. For mitigation, we propose Cross-lingual Cultural Consistent Preference Optimisation (C-3PO), a consensus-driven alignment framework. C-3PO achieves up to a 0.13-point absolute increase in $\kappa_S$ over unaligned models, consistently outperforming strong prompting and representation steering baselines whilst preserving explicit user identities, cultural neutrality and intrinsic cultural knowledge. Empirical evaluations demonstrate this inconsistency disproportionately affects lower-resource languages like Indonesian and Persian. Finally, early decoding of intermediate layers reveals that MLLMs implicitly personalise outputs towards the prompt language's stereotypical culture as forward-pass representations stabilise.

Lucas Resck, Isabelle Augenstein, Anna Korhonen• 2026

Related benchmarks

TaskDatasetResultRank
Cross-lingual Cultural ConsistencyBLEnD All 8 Languages
Max Sigma0.017
15
Cross-lingual Cultural ConsistencyBLEnD Higher-Resource
Max Sigma0.025
15
Cross-lingual Cultural ConsistencyBLEnD Lower-Resource
Max Sigma0.02
15
Cross-lingual Cultural ConsistencyBLEnD Indo-European
Max Sigma0.021
15
Cross-lingual Cultural ConsistencyBLEnD Non-Indo-European
Max Sigma0.022
15
Showing 5 of 5 rows

Other info

Follow for update