Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration

About

Recent studies show that collaborating multiple large language model (LLM) powered agents is a promising way for task solving. However, current approaches are constrained by using a fixed number of agents and static communication structures. In this work, we propose automatically selecting a team of agents from candidates to collaborate in a dynamic communication structure toward different tasks and domains. Specifically, we build a framework named Dynamic LLM-Powered Agent Network ($\textbf{DyLAN}$) for LLM-powered agent collaboration, operating a two-stage paradigm: (1) Team Optimization and (2) Task Solving. During the first stage, we utilize an $\textit{agent selection}$ algorithm, based on an unsupervised metric called $\textit{Agent Importance Score}$, enabling the selection of best agents according to their contributions in a preliminary trial, oriented to the given task. Then, in the second stage, the selected agents collaborate dynamically according to the query. Empirically, we demonstrate that DyLAN outperforms strong baselines in code generation, decision-making, general reasoning, and arithmetic reasoning tasks with moderate computational cost. On specific subjects in MMLU, selecting a team of agents in the team optimization stage improves accuracy by up to 25.0% in DyLAN.

Zijun Liu, Yanzhe Zhang, Peng Li, Yang Liu, Diyi Yang• 2023

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningGSM8K
Accuracy89.98
983
Multi-task Language UnderstandingMMLU
Accuracy95.4
842
Language UnderstandingMMLU
Accuracy93.2
756
Mathematical ReasoningMATH
Accuracy67.7
643
Mathematical ReasoningSVAMP
Accuracy88.48
368
Mathematical ReasoningGSM8K
Accuracy (GSM8K)91.2
358
General AI Assistant TasksGAIA
Accuracy43.7
266
Question AnsweringGPQA
Accuracy53.8
258
Code GenerationCode
Accuracy83.9
242
Automated PlanningPDDL
Accuracy50.4
233
Showing 10 of 34 rows

Other info

Follow for update