Retro*: Learning Retrosynthetic Planning with Neural Guided A* Search
About
Retrosynthetic planning is a critical task in organic chemistry which identifies a series of reactions that can lead to the synthesis of a target product. The vast number of possible chemical transformations makes the size of the search space very big, and retrosynthetic planning is challenging even for experienced chemists. However, existing methods either require expensive return estimation by rollout with high variance, or optimize for search speed rather than the quality. In this paper, we propose Retro*, a neural-based A*-like algorithm that finds high-quality synthetic routes efficiently. It maintains the search as an AND-OR tree, and learns a neural search bias with off-policy data. Then guided by this neural network, it performs best-first search efficiently during new planning episodes. Experiments on benchmark USPTO datasets show that, our proposed method outperforms existing state-of-the-art with respect to both the success rate and solution quality, while being more efficient at the same time.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Retrosynthetic planning | USPTO | Success Rate86.84 | 50 | |
| Retrosynthetic planning | USPTO-EXT (test) | Success Rate57.89 | 30 | |
| Retrosynthetic planning | USPTO 190 target molecules | Path Length5.76 | 11 | |
| Retrosynthetic planning | USPTO 190 (test) | Success Rate (N=100)55.79 | 10 | |
| Retrosynthetic planning | USPTO route Section 5.1.2 (test) | Success Rate86.84 | 7 | |
| Retrosynthesis | ChEMBL-1000 (test) | Solved Target Molecules762 | 7 | |
| Retrosynthesis | GDB17-1000 (test) | Solved Molecules Count95 | 7 | |
| Retrosynthetic planning | USPTO | Route Length9.71 | 5 | |
| Backward reaction prediction | USPTO reaction (test) | Top-1 Acc44.53 | 4 |