Monte Carlo Permutation Search

About

We propose Monte Carlo Permutation Search (MCPS), a general-purpose Monte Carlo Tree Search (MCTS) algorithm that improves upon the GRAVE algorithm. MCPS is relevant when deep reinforcement learning is not an option or when the computing power available before play is not substantial, such as in General Game Playing. The principle of MCPS is to include in the exploration term of a node the statistics on all the playouts that contain all the moves on the path from the root to the node. We test MCPS on a variety of games: Hex, Go, AtariGo, NoGo and a Wargame. MCPS almost always outperforms GRAVE. We also provide a mathematical derivation of the formulas used for weighting the three sources of statistics. These formulas are an improvement on the GRAVE formula since they no longer use the bias hyperparameter of GRAVE.

Tristan Cazenave• 2025

Related benchmarks

Task	Dataset	Result
Game Playing	AtariGo	Winrate69	24
Two-player asymmetric strategy game playing	Wargame	Winrate61.88	20
Board Game Play	11x11 Hex	Win rate76.25	7
Board Game Playing	Go 5x5	Winrate55.5	4
Board Game Playing	Go 7x7	Winrate57.75	4
Board Game Playing	Go 9x9	Winrate60.62	4
Board Game Playing	Go 11x11	Winrate63.25	4
Board Game Playing	Go 13x13	Winrate63.75	4
Board Game Playing	Go 15x15	Winrate63	4
Game Playing	NoGo 5x5	Winrate63.88	4

Showing 10 of 20 rows

Other info

Follow for update

@wizwand_team Discord