Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SOTOPIA-$\Omega$: Dynamic Strategy Injection Learning and Social Instruction Following Evaluation for Social Agents

About

Despite the abundance of prior social strategies possessed by humans, there remains a paucity of research dedicated to their transfer and integration into social agents. Our proposed SOTOPIA-$\Omega$ framework aims to address and bridge this gap, with a particular focus on enhancing the social capabilities of language agents. This framework dynamically injects multi-step reasoning strategies inspired by negotiation theory and two simple direct strategies into expert agents, thereby automating the construction of a high-quality social dialogue training corpus. Additionally, we introduce the concept of Social Instruction Following (S-IF) and propose two new S-IF evaluation metrics that complement social capability. We demonstrate that several 7B models trained on high-quality corpus not only significantly surpass the expert agent (GPT-4) in achieving social goals but also enhance S-IF performance. Analysis and variant experiments validate the advantages of dynamic construction, which can especially break the agent's prolonged deadlock.

Wenyuan Zhang, Tianyun Liu, Mengxiao Song, Xiaodong Li, Tingwen Liu• 2025

Related benchmarks

TaskDatasetResultRank
Social Interaction EvaluationSOTOPIA-Hard (Self-Play)
GOAL Score7.31
24
Social Interaction EvaluationSOTOPIA (Self-Play)
Goal Score8.35
24
Social Interaction EvaluationSOTOPIA-Hard GPT-4o-as-Partner
Goal Score6.87
24
Social Interaction EvaluationSOTOPIA GPT-4o-as-Partner
Goal Score8.15
24
Social SimulationSotopia standard (test)
Goal Score8.07
8
Social SimulationSotopia hard (test)
Goal Score6.31
8
Showing 6 of 6 rows

Other info

Follow for update