Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation

About

AutoGen is an open-source framework that allows developers to build LLM applications via multiple agents that can converse with each other to accomplish tasks. AutoGen agents are customizable, conversable, and can operate in various modes that employ combinations of LLMs, human inputs, and tools. Using AutoGen, developers can also flexibly define agent interaction behaviors. Both natural language and computer code can be used to program flexible conversation patterns for different applications. AutoGen serves as a generic infrastructure to build diverse applications of various complexities and LLM capacities. Empirical studies demonstrate the effectiveness of the framework in many example applications, with domains ranging from mathematics, coding, question answering, operations research, online decision-making, entertainment, etc.

Qingyun Wu, Gagan Bansal, Jieyu Zhang, Yiran Wu, Beibin Li, Erkang Zhu, Li Jiang, Xiaoyun Zhang, Shaokun Zhang, Jiale Liu, Ahmed Hassan Awadallah, Ryen W White, Doug Burger, Chi Wang• 2023

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningGSM8K
Accuracy87.8
1362
Code GenerationHumanEval
Pass@183.5
1036
Mathematical ReasoningMATH
Accuracy69.5
535
Code GenerationHumanEval (test)
Pass@190.4
506
Mathematical ReasoningGSM8K
Accuracy (GSM8K)94.54
358
Code GenerationMBPP (test)
Pass@192.3
298
Arithmetic ReasoningMultiArith
Accuracy95.05
229
Code GenerationMBPP
Accuracy (%)85.3
146
Mathematical ReasoningMATH 500
Accuracy72.8
119
Code GenerationHumanEval
Accuracy96.95
99
Showing 10 of 117 rows
...

Other info

Follow for update