Language Agents as Optimizable Graphs

About

Various human-designed prompt engineering techniques have been proposed to improve problem solvers based on Large Language Models (LLMs), yielding many disparate code bases. We unify these approaches by describing LLM-based agents as computational graphs. The nodes implement functions to process multimodal data or query LLMs, and the edges describe the information flow between operations. Graphs can be recursively combined into larger composite graphs representing hierarchies of inter-agent collaboration (where edges connect operations of different agents). Our novel automatic graph optimizers (1) refine node-level LLM prompts (node optimization) and (2) improve agent orchestration by changing graph connectivity (edge optimization). Experiments demonstrate that our framework can be used to efficiently develop, integrate, and automatically improve various LLM agents. The code can be found at https://github.com/metauto-ai/gptswarm.

Mingchen Zhuge, Wenyi Wang, Louis Kirsch, Francesco Faccio, Dmitrii Khizbullin, J\"urgen Schmidhuber• 2024

Related benchmarks

Task	Dataset	Result
Mathematical Reasoning	GSM8K (test)	Accuracy89.1	954
Code Generation	HumanEval (test)	--	701
Multi-task Language Understanding	MMLU	MMLU Accuracy63.5	456
Code Generation	MBPP (test)	--	411
Multi-hop Question Answering	HotpotQA (test)	F173.2	334
Reasoning	MMLU-Pro	Accuracy82.86	264
Multitask Language Understanding	MMLU	--	263
Code Generation	HumanEval	Accuracy93.7	224
Mathematical Reasoning	GSM8K	--	220
Mathematics	AIME25	Accuracy36.67	103

Showing 10 of 18 rows

Other info

Follow for update

@wizwand_team Discord