AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation

About

AutoGen is an open-source framework that allows developers to build LLM applications via multiple agents that can converse with each other to accomplish tasks. AutoGen agents are customizable, conversable, and can operate in various modes that employ combinations of LLMs, human inputs, and tools. Using AutoGen, developers can also flexibly define agent interaction behaviors. Both natural language and computer code can be used to program flexible conversation patterns for different applications. AutoGen serves as a generic infrastructure to build diverse applications of various complexities and LLM capacities. Empirical studies demonstrate the effectiveness of the framework in many example applications, with domains ranging from mathematics, coding, question answering, operations research, online decision-making, entertainment, etc.

Qingyun Wu, Gagan Bansal, Jieyu Zhang, Yiran Wu, Beibin Li, Erkang Zhu, Li Jiang, Xiaoyun Zhang, Shaokun Zhang, Jiale Liu, Ahmed Hassan Awadallah, Ryen W White, Doug Burger, Chi Wang• 2023

Related benchmarks

Task	Dataset	Result
Mathematical Reasoning	GSM8K	Accuracy87.8	1424
Code Generation	HumanEval	Pass@183.5	1048
Code Generation	HumanEval (test)	Pass@190.4	701
Multitask Language Understanding	MMLU	Accuracy82.34	568
Mathematical Reasoning	MATH	Accuracy69.5	535
Code Generation	MBPP (test)	Pass@192.3	411
Mathematical Reasoning	AIME 2024	Accuracy26.67	394
Mathematical Reasoning	AIME 2025	Accuracy20	378
Mathematical Reasoning	GSM8K	Accuracy (GSM8K)94.54	358
Arithmetic Reasoning	MultiArith	Accuracy95.05	324

Showing 10 of 199 rows

...

Other info

Follow for update

@wizwand_team Discord