Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Social diversity and social preferences in mixed-motive reinforcement learning

About

Recent research on reinforcement learning in pure-conflict and pure-common interest games has emphasized the importance of population heterogeneity. In contrast, studies of reinforcement learning in mixed-motive games have primarily leveraged homogeneous approaches. Given the defining characteristic of mixed-motive games--the imperfect correlation of incentives between group members--we study the effect of population heterogeneity on mixed-motive reinforcement learning. We draw on interdependence theory from social psychology and imbue reinforcement learning agents with Social Value Orientation (SVO), a flexible formalization of preferences over group outcome distributions. We subsequently explore the effects of diversity in SVO on populations of reinforcement learning agents in two mixed-motive Markov games. We demonstrate that heterogeneity in SVO generates meaningful and complex behavioral variation among agents similar to that suggested by interdependence theory. Empirical results in these mixed-motive dilemmas suggest agents trained in heterogeneous populations develop particularly generalized, high-performing policies relative to those trained in homogeneous populations.

Kevin R. McKee, Ian Gemp, Brian McWilliams, Edgar A. Du\'e\~nez-Guzm\'an, Edward Hughes, Joel Z. Leibo• 2020

Related benchmarks

TaskDatasetResultRank
Multi-agent Social Dilemma Equality EvaluationHarvest
Equality Score (E)97.4
9
Multi-agent Social Dilemma Equality EvaluationCleanup
Equality Score (E)90.2
9
Social Dilemma CooperationTwo-Player Public Goods Game (test)
r11.104
7
Showing 3 of 3 rows

Other info

Follow for update