Predicting Retrosynthetic Reaction using Self-Corrected Transformer Neural Networks
About
Synthesis planning is the process of recursively decomposing target molecules into available precursors. Computer-aided retrosynthesis can potentially assist chemists in designing synthetic routes, but at present it is cumbersome and provides results of dissatisfactory quality. In this study, we develop a template-free self-corrected retrosynthesis predictor (SCROP) to perform a retrosynthesis prediction task trained by using the Transformer neural network architecture. In the method, the retrosynthesis planning is converted as a machine translation problem between molecular linear notations of reactants and the products. Coupled with a neural network-based syntax corrector, our method achieves an accuracy of 59.0% on a standard benchmark dataset, which increases >21% over other deep learning methods, and >6% over template-based methods. More importantly, our method shows an accuracy 1.7 times higher than other state-of-the-art methods for compounds not appearing in the training set.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Retrosynthesis | USPTO-50k Reaction type unknown (test) | Top-1 Accuracy43.7 | 59 | |
| Retrosynthesis | USPTO-50k Reaction type known (test) | Top-1 Accuracy59 | 50 | |
| Retrosynthesis | USPTO-50K | Top-1 Accuracy59 | 33 | |
| Retrosynthesis prediction | USPTO-50K | Top-1 Acc (Unknown)43.7 | 22 | |
| Retrosynthesis (reaction class not given) | USPTO-50k (test) | Top-1 Acc43.7 | 14 | |
| Retrosynthesis prediction | USPTO-50k (40/5/5) | Top-1 Accuracy0.437 | 8 | |
| Retro-reaction prediction | Reaction Prediction Dataset | Top-1 Accuracy43.7 | 5 |