Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ParallelMuse: Agentic Parallel Thinking for Deep Information Seeking

About

Parallel thinking expands exploration breadth, complementing the deep exploration of information-seeking (IS) agents to further enhance problem-solving capability. However, conventional parallel thinking faces two key challenges in this setting: inefficiency from repeatedly rolling out from scratch, and difficulty in integrating long-horizon reasoning trajectories during answer generation, as limited context capacity prevents full consideration of the reasoning process. To address these issues, we propose ParallelMuse, a two-stage paradigm designed for deep IS agents. The first stage, Functionality-Specified Partial Rollout, partitions generated sequences into functional regions and performs uncertainty-guided path reuse and branching to enhance exploration efficiency. The second stage, Compressed Reasoning Aggregation, exploits reasoning redundancy to losslessly compress information relevant to answer derivation and synthesize a coherent final answer. Experiments across multiple open-source agents and benchmarks demonstrate up to 62% performance improvement with a 10--30% reduction in exploratory token consumption.

Baixuan Li, Dingchu Zhang, Jialong Wu, Wenbiao Yin, Zhengwei Tao, Yida Zhao, Liwen Zhang, Haiyang Shen, Runnan Fang, Pengjun Xie, Jingren Zhou, Yong Jiang• 2025

Related benchmarks

TaskDatasetResultRank
Web BrowsingBrowsecomp
Accuracy70
52
Logical reasoningHLE
Accuracy0.5806
46
Medical ReasoningHealthBench Hard
Accuracy23
41
BrowseComp-PlusBrowseComp+
Accuracy73.33
25
HLEHLE
Accuracy50.32
25
Long-horizon agentic taskBrowseComp+
Performance76.67
24
Long-horizon agentic taskHLE
Performance58.06
24
Long-horizon agentic taskBrowsecomp
Performance70
24
DeepSearchQADeepSearchQA
Accuracy64
19
Question AnsweringDeepSearchQA
Accuracy61.33
19
Showing 10 of 19 rows

Other info

Follow for update