Scaling Large Language Model-based Multi-Agent Collaboration

About

Recent breakthroughs in large language model-driven autonomous agents have revealed that multi-agent collaboration often surpasses each individual through collective reasoning. Inspired by the neural scaling law--increasing neurons enhances performance, this study explores whether the continuous addition of collaborative agents can yield similar benefits. Technically, we utilize directed acyclic graphs to organize agents into a multi-agent collaboration network (MacNet), upon which their interactive reasoning is topologically orchestrated for autonomous task solving. Extensive evaluations reveal that it effectively supports collaboration among over a thousand agents, with irregular topologies outperforming regular ones. We also identify a collaborative scaling law--the overall performance follows a logistic growth pattern as agents scale, with collaborative emergence occurring earlier than traditional neural emergence. We speculate this may be because scaling agents catalyzes their multidimensional considerations during interactive reflection and refinement, thereby producing more comprehensive artifacts. The code is available at https://github.com/OpenBMB/ChatDev/tree/macnet.

Chen Qian, Zihao Xie, YiFei Wang, Wei Liu, Kunlun Zhu, Hanchen Xia, Yufan Dang, Zhuoyun Du, Weize Chen, Cheng Yang, Zhiyuan Liu, Maosong Sun• 2024

Related benchmarks

Task	Dataset	Result
Mathematical Reasoning	GSM8K	Accuracy87.95	1398
Code Generation	HumanEval	Pass@184.57	1043
Mathematical Reasoning	MATH	Accuracy72.1	882
Multi-task Language Understanding	MMLU	Accuracy98	881
Language Understanding	MMLU	Accuracy88.1	844
Code Generation	HumanEval (test)	Pass@195.8	612
Multitask Language Understanding	MMLU	Accuracy84.31	520
Mathematical Reasoning	GSM8K	Accuracy83.01	499
Multi-task Language Understanding	MMLU	MMLU Accuracy64.05	442
Code Generation	MBPP (test)	Pass@190.3	405

Showing 10 of 77 rows

...

Other info

Follow for update

@wizwand_team Discord