MSP-LLM: A Unified Large Language Model Framework for Complete Material Synthesis Planning
About
Material synthesis planning (MSP) remains a fundamental and underexplored bottleneck in AI-driven materials discovery, as it requires not only identifying suitable precursor materials but also designing coherent sequences of synthesis operations to realize a target material. Although several AI-based approaches have been proposed to address isolated subtasks of MSP, a unified methodology for solving the entire MSP task has yet to be established. We propose MSP-LLM, a unified LLM-based framework that formulates MSP as a structured process composed of two constituent subproblems: precursor prediction (PP) and synthesis operation prediction (SOP). Our approach introduces a discrete material class as an intermediate decision variable that organizes both tasks into a chemically consistent decision chain. For SOP, we further incorporate hierarchical precursor types as synthesis-relevant inductive biases and employ an explicit conditioning strategy that preserves precursor-related information in the autoregressive decoding state. Extensive experiments show that MSP-LLM consistently outperforms existing methods on both PP and SOP, as well as on the complete MSP task, demonstrating an effective and scalable framework for MSP that can accelerate real-world materials discovery.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Precursor Prediction | Inorganic material synthesis dataset (Split 1) | Top-1 Accuracy71.02 | 12 | |
| Precursor Prediction | Inorganic material synthesis dataset (Split 2) | Top-1 Acc72.85 | 12 | |
| Synthesis Operation Prediction | Material Synthesis Dataset (Split 1) | Top-1 Accuracy7.86 | 12 | |
| Synthesis Operation Prediction | Material Synthesis Dataset (Split 2) | Top-1 Accuracy8.77 | 12 | |
| Synthesis Operation Prediction | Inorganic Material Synthesis (Split 1) | NED0.7174 | 12 | |
| Synthesis Operation Prediction | Inorganic Material Synthesis (Split 2) | NED0.742 | 12 | |
| Material Synthesis Planning | MSP Dataset (Split 2) | Top-1 Accuracy6.72 | 11 | |
| Material Synthesis Planning | MSP Dataset (Split 1) | Top-1 Accuracy5.76 | 11 |