SOTOPIA-$\Omega$: Dynamic Strategy Injection Learning and Social Instruction Following Evaluation for Social Agents

About

Despite the abundance of prior social strategies possessed by humans, there remains a paucity of research dedicated to their transfer and integration into social agents. Our proposed SOTOPIA-$\Omega$ framework aims to address and bridge this gap, with a particular focus on enhancing the social capabilities of language agents. This framework dynamically injects multi-step reasoning strategies inspired by negotiation theory and two simple direct strategies into expert agents, thereby automating the construction of a high-quality social dialogue training corpus. Additionally, we introduce the concept of Social Instruction Following (S-IF) and propose two new S-IF evaluation metrics that complement social capability. We demonstrate that several 7B models trained on high-quality corpus not only significantly surpass the expert agent (GPT-4) in achieving social goals but also enhance S-IF performance. Analysis and variant experiments validate the advantages of dynamic construction, which can especially break the agent's prolonged deadlock.

Wenyuan Zhang, Tianyun Liu, Mengxiao Song, Xiaodong Li, Tingwen Liu• 2025

Related benchmarks

Task	Dataset	Result
Social Interaction	SOTOPIA all social scenarios	Goal Score8.35	47
Social Interaction	Sotopia hard	GOAL Score7.31	35
Social Interaction Evaluation	SOTOPIA-Hard (Self-Play)	GOAL Score7.31	24
Social Interaction Evaluation	SOTOPIA (Self-Play)	Goal Score8.35	24
Social Interaction Evaluation	SOTOPIA-Hard GPT-4o-as-Partner	Goal Score6.87	24
Social Interaction Evaluation	SOTOPIA GPT-4o-as-Partner	Goal Score8.15	24
Social Simulation	Sotopia standard (test)	Goal Score8.07	8
Social Simulation	Sotopia hard (test)	Goal Score6.31	8
Social agent strategy optimization	Sotopia subset	Goal Score7.31	3

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord