Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Monte Carlo Permutation Search

About

We propose Monte Carlo Permutation Search (MCPS), a general-purpose Monte Carlo Tree Search (MCTS) algorithm that improves upon the GRAVE algorithm. MCPS is relevant when deep reinforcement learning is not an option or when the computing power available before play is not substantial, such as in General Game Playing. The principle of MCPS is to include in the exploration term of a node the statistics on all the playouts that contain all the moves on the path from the root to the node. We test MCPS on a variety of games: Hex, Go, AtariGo, NoGo and a Wargame. MCPS almost always outperforms GRAVE. We also provide a mathematical derivation of the formulas used for weighting the three sources of statistics. These formulas are an improvement on the GRAVE formula since they no longer use the bias hyperparameter of GRAVE.

Tristan Cazenave• 2025

Related benchmarks

TaskDatasetResultRank
Game PlayingAtariGo
Winrate69
24
Two-player asymmetric strategy game playingWargame
Winrate61.88
20
Board Game Play11x11 Hex
Win rate76.25
7
Board Game PlayingGo 5x5
Winrate55.5
4
Board Game PlayingGo 7x7
Winrate57.75
4
Board Game PlayingGo 9x9
Winrate60.62
4
Board Game PlayingGo 11x11
Winrate63.25
4
Board Game PlayingGo 13x13
Winrate63.75
4
Board Game PlayingGo 15x15
Winrate63
4
Game PlayingNoGo 5x5
Winrate63.88
4
Showing 10 of 20 rows

Other info

Follow for update