SciPIP: An LLM-based Scientific Paper Idea Proposer

About

The rapid advancement of large language models (LLMs) has opened new possibilities for automating the proposal of innovative scientific ideas. This process involves two key phases: literature retrieval and idea generation. However, existing approaches often fall short due to their reliance on keyword-based search tools during the retrieval phase, which neglects crucial semantic information and frequently results in incomplete retrieval outcomes. Similarly, in the idea generation phase, current methodologies tend to depend solely on the internal knowledge of LLMs or metadata from retrieved papers, thereby overlooking significant valuable insights contained within the full texts. To address these limitations, we introduce SciPIP, an innovative framework designed to enhance the LLM-based proposal of scientific ideas through improvements in both literature retrieval and idea generation. Our approach begins with the construction of a comprehensive literature database that supports advanced retrieval based not only on keywords but also on semantics and citation relationships. This is complemented by the introduction of a multi-granularity retrieval algorithm aimed at ensuring more thorough and exhaustive retrieval results. For the idea generation phase, we propose a dual-path framework that effectively integrates both the content of retrieved papers and the extensive internal knowledge of LLMs. This integration significantly boosts the novelty, feasibility, and practical value of proposed ideas. Our experiments, conducted across various domains such as natural language processing and computer vision, demonstrate SciPIP's capability to generate a multitude of innovative and useful ideas. These findings underscore SciPIP's potential as a valuable tool for researchers seeking to advance their fields with groundbreaking concepts.

Wenxiao Wang, Lihui Gu, Liye Zhang, Yunxiang Luo, Yi Dai, Chen Shen, Liang Xie, Binbin Lin, Xiaofei He, Jieping Ye• 2024

Related benchmarks

Task	Dataset	Result
Subjective evaluation of research ideas	100 Research Ideas 10 Benchmark Topics	Novelty Score4.87	15
Research Idea Generation	Ten benchmark topics (100 generated research ideas)	Average Wins2.7	15
Idea Generation Assessment	AI-Idea-Bench 2025	Motivation Score3.68	12
Scientific Idea Generation	NeurIPS 2025	--	12
Research Proposal Generation	AI Idea Bench (AIIB) held-out 2025	AIIB Score6.93	7
Research Proposal Generation	LiveIdeaBench held-out	Live Score6.78	7
Research Proposal Generation	AIIB and LiveIdeaBench Combined	Average Score6.86	7
Scientific Idea Generation	AI-Idea-Bench 2025	Reward Novelty0.5	7
Scientific Proposal Generation	AI Idea Bench and LiveIdeaBench 24 held-out benchmark groups 2025	Novelty2.71	7
Scientific ideation	NeurIPS 2025 (CV)	Novelty6.16	6

Showing 10 of 14 rows

Other info

Follow for update

@wizwand_team Discord