AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors

About

Autonomous agents empowered by Large Language Models (LLMs) have undergone significant improvements, enabling them to generalize across a broad spectrum of tasks. However, in real-world scenarios, cooperation among individuals is often required to enhance the efficiency and effectiveness of task accomplishment. Hence, inspired by human group dynamics, we propose a multi-agent framework \framework that can collaboratively and dynamically adjust its composition as a greater-than-the-sum-of-its-parts system. Our experiments demonstrate that \framework framework can effectively deploy multi-agent groups that outperform a single agent. Furthermore, we delve into the emergence of social behaviors among individual agents within a group during collaborative task accomplishment. In view of these behaviors, we discuss some possible strategies to leverage positive ones and mitigate negative ones for improving the collaborative potential of multi-agent groups. Our codes for \framework will soon be released at \url{https://github.com/OpenBMB/AgentVerse}.

Weize Chen, Yusheng Su, Jingwei Zuo, Cheng Yang, Chenfei Yuan, Chi-Min Chan, Heyang Yu, Yaxi Lu, Yi-Hsin Hung, Chen Qian, Yujia Qin, Xin Cong, Ruobing Xie, Zhiyuan Liu, Maosong Sun, Jie Zhou• 2023

Related benchmarks

Task	Dataset	Result
Mathematical Reasoning	GSM8K	Accuracy89.91	1424
Code Generation	HumanEval	Pass@196.84	1048
Mathematical Reasoning	MATH	Accuracy55.6	882
Multi-task Language Understanding	MMLU	Accuracy78.36	881
Code Generation	HumanEval (test)	Pass@184.72	701
Multitask Language Understanding	MMLU	Accuracy81.57	568
Mathematical Reasoning	MATH	Accuracy54.5	535
Code Generation	MBPP (test)	--	411
Mathematical Reasoning	SVAMP	Accuracy89.64	403
Code Generation	HumanEval+	--	393

Showing 10 of 111 rows

...

Other info

Follow for update

@wizwand_team Discord