Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Chain of Ideas: Revolutionizing Research Via Novel Idea Development with LLM Agents

About

Effective research ideation is a critical step for scientific research. However, the exponential increase in scientific literature makes it challenging for researchers to stay current with recent advances and identify meaningful research directions. Recent developments in large language models~(LLMs) suggest a promising avenue for automating the generation of novel research ideas. However, existing methods for idea generation either trivially prompt LLMs or directly expose LLMs to extensive literature without indicating useful information. Inspired by the research process of human researchers, we propose a Chain-of-Ideas~(CoI) agent, an LLM-based agent that organizes relevant literature in a chain structure to effectively mirror the progressive development in a research domain. This organization facilitates LLMs to capture the current advancements in research, thereby enhancing their ideation capabilities. Furthermore, we propose Idea Arena, an evaluation protocol that can comprehensively evaluate idea generation methods from different perspectives, aligning closely with the preferences of human researchers. Experimental results indicate that the CoI agent consistently outperforms other methods and shows comparable quality as humans in research idea generation. Moreover, our CoI agent is budget-friendly, with a minimum cost of \$0.50 to generate a candidate idea and its corresponding experimental design.

Long Li, Weiwen Xu, Jiayan Guo, Ruochen Zhao, Xingxuan Li, Yuqian Yuan, Boqiang Zhang, Yuming Jiang, Yifei Xin, Ronghao Dang, Deli Zhao, Yu Rong, Tian Feng, Lidong Bing• 2024

Related benchmarks

TaskDatasetResultRank
Future-aligned research proposal predictionfuture corpus of ML papers (test)
Hypothesis Score63.8
24
Idea Generation AssessmentAI-Idea-Bench 2025
Motivation Score3.74
12
Scientific Idea GenerationAI-Idea-Bench 2025
Reward Novelty0.59
7
Scientific Idea GenerationIdeaBench
Semantic Similarity0.482
6
Showing 4 of 4 rows

Other info

Follow for update