Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Gaussian Process Aggregation for Root-Parallel Monte Carlo Tree Search with Continuous Actions

About

Monte Carlo Tree Search is a cornerstone algorithm for online planning, and its root-parallel variant is widely used when wall clock time is limited but best performance is desired. In environments with continuous action spaces, how to best aggregate statistics from different threads is an important yet underexplored question. In this work, we introduce a method that uses Gaussian Process Regression to obtain value estimates for promising actions that were not trialed in the environment. We perform a systematic evaluation across 6 different domains, demonstrating that our approach outperforms existing aggregation strategies while requiring a modest increase in inference time.

Junlin Xiao, Victor-Alexandru Darvariu, Bruno Lacerda, Nick Hawes• 2025

Related benchmarks

TaskDatasetResultRank
MCTS Aggregation Strategy EvaluationMountain Car
MRR1
6
MCTS Aggregation Strategy EvaluationRandom Teleporter
MRR1
6
MCTS Aggregation Strategy EvaluationWide Corridor
MRR1
6
MCTS Aggregation Strategy EvaluationNarrow Corridor
MRR1
6
MCTS Aggregation Strategy EvaluationLunar Lander
MRR75
6
MCTS Aggregation Strategy EvaluationPendulum
MRR0.75
6
Showing 6 of 6 rows

Other info

Follow for update