SOTOPIA-$\Omega$: Dynamic Strategy Injection Learning and Social Instruction Following Evaluation for Social Agents
About
Despite the abundance of prior social strategies possessed by humans, there remains a paucity of research dedicated to their transfer and integration into social agents. Our proposed SOTOPIA-$\Omega$ framework aims to address and bridge this gap, with a particular focus on enhancing the social capabilities of language agents. This framework dynamically injects multi-step reasoning strategies inspired by negotiation theory and two simple direct strategies into expert agents, thereby automating the construction of a high-quality social dialogue training corpus. Additionally, we introduce the concept of Social Instruction Following (S-IF) and propose two new S-IF evaluation metrics that complement social capability. We demonstrate that several 7B models trained on high-quality corpus not only significantly surpass the expert agent (GPT-4) in achieving social goals but also enhance S-IF performance. Analysis and variant experiments validate the advantages of dynamic construction, which can especially break the agent's prolonged deadlock.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Social Interaction Evaluation | SOTOPIA-Hard (Self-Play) | GOAL Score7.31 | 24 | |
| Social Interaction Evaluation | SOTOPIA (Self-Play) | Goal Score8.35 | 24 | |
| Social Interaction Evaluation | SOTOPIA-Hard GPT-4o-as-Partner | Goal Score6.87 | 24 | |
| Social Interaction Evaluation | SOTOPIA GPT-4o-as-Partner | Goal Score8.15 | 24 | |
| Social Simulation | Sotopia standard (test) | Goal Score8.07 | 8 | |
| Social Simulation | Sotopia hard (test) | Goal Score6.31 | 8 |