Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors

About

Autonomous agents empowered by Large Language Models (LLMs) have undergone significant improvements, enabling them to generalize across a broad spectrum of tasks. However, in real-world scenarios, cooperation among individuals is often required to enhance the efficiency and effectiveness of task accomplishment. Hence, inspired by human group dynamics, we propose a multi-agent framework \framework that can collaboratively and dynamically adjust its composition as a greater-than-the-sum-of-its-parts system. Our experiments demonstrate that \framework framework can effectively deploy multi-agent groups that outperform a single agent. Furthermore, we delve into the emergence of social behaviors among individual agents within a group during collaborative task accomplishment. In view of these behaviors, we discuss some possible strategies to leverage positive ones and mitigate negative ones for improving the collaborative potential of multi-agent groups. Our codes for \framework will soon be released at \url{https://github.com/OpenBMB/AgentVerse}.

Weize Chen, Yusheng Su, Jingwei Zuo, Cheng Yang, Chenfei Yuan, Chi-Min Chan, Heyang Yu, Yaxi Lu, Yi-Hsin Hung, Chen Qian, Yujia Qin, Xin Cong, Ruobing Xie, Zhiyuan Liu, Maosong Sun, Jie Zhou• 2023

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningGSM8K
Accuracy89.91
983
Code GenerationHumanEval
Pass@196.84
850
Multi-task Language UnderstandingMMLU
Accuracy78.36
842
Mathematical ReasoningMATH
Accuracy55.6
643
Mathematical ReasoningMATH
Accuracy54.5
535
Mathematical ReasoningSVAMP
Accuracy89.64
368
Mathematical ReasoningGSM8K
Accuracy (GSM8K)93.4
358
Question AnsweringGPQA
Accuracy40.2
258
Code GenerationHumanEval+--
189
Code GenerationMBPP
Accuracy (%)82.4
146
Showing 10 of 27 rows

Other info

Follow for update