A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration

About

Recent studies show that collaborating multiple large language model (LLM) powered agents is a promising way for task solving. However, current approaches are constrained by using a fixed number of agents and static communication structures. In this work, we propose automatically selecting a team of agents from candidates to collaborate in a dynamic communication structure toward different tasks and domains. Specifically, we build a framework named Dynamic LLM-Powered Agent Network ($\textbf{DyLAN}$) for LLM-powered agent collaboration, operating a two-stage paradigm: (1) Team Optimization and (2) Task Solving. During the first stage, we utilize an $\textit{agent selection}$ algorithm, based on an unsupervised metric called $\textit{Agent Importance Score}$, enabling the selection of best agents according to their contributions in a preliminary trial, oriented to the given task. Then, in the second stage, the selected agents collaborate dynamically according to the query. Empirically, we demonstrate that DyLAN outperforms strong baselines in code generation, decision-making, general reasoning, and arithmetic reasoning tasks with moderate computational cost. On specific subjects in MMLU, selecting a team of agents in the team optimization stage improves accuracy by up to 25.0% in DyLAN.

Zijun Liu, Yanzhe Zhang, Peng Li, Yang Liu, Diyi Yang• 2023

Related benchmarks

Task	Dataset	Result
Mathematical Reasoning	GSM8K	Accuracy89.98	1424
Code Generation	HumanEval	Pass@190.42	1048
Mathematical Reasoning	GSM8K (test)	Accuracy90	954
Mathematical Reasoning	MATH	Accuracy67.7	882
Multi-task Language Understanding	MMLU	Accuracy95.4	881
Language Understanding	MMLU	Accuracy93.2	844
Code Generation	HumanEval (test)	Pass@190.42	701
Mathematical Reasoning	AIME 2024	Accuracy16.7	525
Mathematical Reasoning	MATH 500	Top-1 Accuracy81.66	452
Code Generation	MBPP (test)	Pass@177.3	411

Showing 10 of 130 rows

...

Other info

Follow for update

@wizwand_team Discord