Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication

About

Natural language (NL) has long been the predominant format for human cognition and communication, and by extension, has been similarly pivotal in the development and application of Large Language Models (LLMs). Yet, besides NL, LLMs have seen various non-NL formats during pre-training, such as code and logical expression. NL's status as the optimal format for LLMs, particularly in single-LLM reasoning and multi-agent communication, has not been thoroughly examined. In this work, we challenge the default use of NL by exploring the utility of non-NL formats in these contexts. We show that allowing LLMs to autonomously select the most suitable format before reasoning or communicating leads to a 3.3 to 5.7\% improvement in reasoning efficiency for different LLMs, and up to a 72.7\% reduction in token usage in multi-agent communication, all while maintaining communicative effectiveness. Our comprehensive analysis further reveals that LLMs can devise a format from limited task instructions and that the devised format is effectively transferable across different LLMs. Intriguingly, the structured communication format decided by LLMs exhibits notable parallels with established agent communication languages, suggesting a natural evolution towards efficient, structured communication in agent communication. Our code is released at \url{https://github.com/thunlp/AutoForm}.

Weize Chen, Chenfei Yuan, Jiarui Yuan, Yusheng Su, Chen Qian, Cheng Yang, Ruobing Xie, Zhiyuan Liu, Maosong Sun• 2024

Related benchmarks

TaskDatasetResultRank
DebateMATH
Accuracy26.1
10
Information ExchangeHotpotQA
F1 Score0.282
10
Information Exchange2WMH QA
F1 Score24.7
10
Information ExchangeTriviaQA
F1 Score60.9
10
DebateGSM8K
Accuracy71
10
DebateARC-C
Accuracy60.2
10
DebateMMLU
Acc43.8
10
Information ExchangeCBT
F1 Score35
10
Showing 8 of 8 rows

Other info

Follow for update