Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning
About
Retrosynthesis planning, essential in organic synthesis and drug discovery, has greatly benefited from recent AI-driven advancements. Nevertheless, existing methods frequently face limitations in both applicability and explainability. Traditional graph-based and sequence-to-sequence models often lack generalized chemical knowledge, leading to predictions that are neither consistently accurate nor easily explainable. To address these challenges, we introduce RetroDFM-R, a reasoning-based large language model (LLM) designed specifically for chemical retrosynthesis. Leveraging large-scale reinforcement learning guided by chemically verifiable rewards, RetroDFM-R significantly enhances prediction accuracy and explainability. Comprehensive evaluations demonstrate that RetroDFM-R significantly outperforms state-of-the-art methods, achieving a top-1 accuracy of 65.0% on the USPTO-50K benchmark. Double-blind human assessments further validate the chemical plausibility and practical utility of RetroDFM-R's predictions. RetroDFM-R also accurately predicts multistep retrosynthetic routes reported in the literature for both real-world drug molecules and perovskite materials. Crucially, the model's explicit reasoning process provides human-interpretable insights, thereby enhancing trust and practical value in real-world retrosynthesis applications.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Retrosynthesis | USPTO-50k Without Reaction Type | Top-1 Accuracy59 | 30 | |
| Chemical Reaction Prediction | USPTO50k | Accuracy (%)59 | 21 | |
| Retrosynthesis | USPTO Full | Top-1 Accuracy50.5 | 5 |