CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society

About

The rapid advancement of chat-based language models has led to remarkable progress in complex task-solving. However, their success heavily relies on human input to guide the conversation, which can be challenging and time-consuming. This paper explores the potential of building scalable techniques to facilitate autonomous cooperation among communicative agents, and provides insight into their "cognitive" processes. To address the challenges of achieving autonomous cooperation, we propose a novel communicative agent framework named role-playing. Our approach involves using inception prompting to guide chat agents toward task completion while maintaining consistency with human intentions. We showcase how role-playing can be used to generate conversational data for studying the behaviors and capabilities of a society of agents, providing a valuable resource for investigating conversational language models. In particular, we conduct comprehensive studies on instruction-following cooperation in multi-agent settings. Our contributions include introducing a novel communicative agent framework, offering a scalable approach for studying the cooperative behaviors and capabilities of multi-agent systems, and open-sourcing our library to support research on communicative agents and beyond: https://github.com/camel-ai/camel.

Guohao Li, Hasan Abed Al Kader Hammoud, Hani Itani, Dmitrii Khizbullin, Bernard Ghanem• 2023

Related benchmarks

Task	Dataset	Result
Mathematical Reasoning	GSM8K	Accuracy45.6	1424
Code Generation	HumanEval	Pass@131.71	1048
Mathematical Reasoning	MATH 500	Accuracy95.7	589
Mathematical Reasoning	MATH	Accuracy22.3	535
Interactive Decision-making	AlfWorld	--	398
Mathematical Reasoning	MATH 500	pass@167.4	239
Code Generation	MBPP	Accuracy (%)78.1	146
Mathematical Reasoning	GSM8K	EM88.6	123
General AI Assistant Task	GAIA (val)	Level 1 Score81.13	97
Science Reasoning	GPQA	Pass@111.11	50

Showing 10 of 63 rows

Other info

Follow for update

@wizwand_team Discord