Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration

About

Human intelligence thrives on cognitive synergy, where collaboration among different minds yield superior outcomes compared to isolated individuals. In this work, we propose Solo Performance Prompting (SPP), which transforms a single LLM into a cognitive synergist by engaging in multi-turn self-collaboration with multiple personas. A cognitive synergist is an intelligent agent that collaboratively combines multiple minds' strengths and knowledge to enhance problem-solving in complex tasks. By dynamically identifying and simulating different personas based on task inputs, SPP unleashes the potential of cognitive synergy in LLMs. Our in-depth analysis shows that assigning multiple fine-grained personas in LLMs improves problem-solving abilities compared to using a single or fixed number of personas. We evaluate SPP on three challenging tasks: Trivia Creative Writing, Codenames Collaborative, and Logic Grid Puzzle, encompassing both knowledge-intensive and reasoning-intensive types. Unlike previous works, such as Chain-of-Thought, that solely enhance the reasoning abilities in LLMs, experimental results demonstrate that SPP effectively reduces factual hallucination, and maintains strong reasoning capabilities. Additionally, comparative experiments show that cognitive synergy only emerges in GPT-4 and does not appear in less capable models, such as GPT-3.5-turbo and Llama2-13b-chat, which draws an interesting analogy to human development. Code, data, and prompts can be found at: https://github.com/MikeWangWZHL/Solo-Performance-Prompting.git.

Zhenhailong Wang, Shaoguang Mao, Wenshan Wu, Tao Ge, Furu Wei, Heng Ji• 2023

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningMATH
Accuracy51.7
643
Mathematical ReasoningGSM8K
Accuracy (GSM8K)92.8
358
Question AnsweringGPQA
Accuracy35.1
258
Code GenerationHumanEval+--
189
Code GenerationMBPP
Pass@173.3
175
Code GenerationHumanEval
Pass@189.1
108
ReasoningDROP
Score81.62
21
Scientific problem solvingSciBench
Pass@2028.22
17
Mathematical ReasoningGSM8K
Success Rate87.1
16
Medical ReasoningMedAgentsBench Hard Subsets
MEDQA0.45
12
Showing 10 of 22 rows

Other info

Follow for update