Monte Carlo Permutation Search
About
We propose Monte Carlo Permutation Search (MCPS), a general-purpose Monte Carlo Tree Search (MCTS) algorithm that improves upon the GRAVE algorithm. MCPS is relevant when deep reinforcement learning is not an option or when the computing power available before play is not substantial, such as in General Game Playing. The principle of MCPS is to include in the exploration term of a node the statistics on all the playouts that contain all the moves on the path from the root to the node. We test MCPS on a variety of games: Hex, Go, AtariGo, NoGo and a Wargame. MCPS almost always outperforms GRAVE. We also provide a mathematical derivation of the formulas used for weighting the three sources of statistics. These formulas are an improvement on the GRAVE formula since they no longer use the bias hyperparameter of GRAVE.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Game Playing | AtariGo | Winrate69 | 24 | |
| Two-player asymmetric strategy game playing | Wargame | Winrate61.88 | 20 | |
| Board Game Play | 11x11 Hex | Win rate76.25 | 7 | |
| Board Game Playing | Go 5x5 | Winrate55.5 | 4 | |
| Board Game Playing | Go 7x7 | Winrate57.75 | 4 | |
| Board Game Playing | Go 9x9 | Winrate60.62 | 4 | |
| Board Game Playing | Go 11x11 | Winrate63.25 | 4 | |
| Board Game Playing | Go 13x13 | Winrate63.75 | 4 | |
| Board Game Playing | Go 15x15 | Winrate63 | 4 | |
| Game Playing | NoGo 5x5 | Winrate63.88 | 4 |