Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

deCIFer: Crystal Structure Prediction from Powder Diffraction Data using Autoregressive Language Models

About

Novel materials drive advancements in fields ranging from energy storage to electronics, with crystal structure characterization forming a crucial yet challenging step in materials discovery. In this work, we introduce \emph{deCIFer}, an autoregressive language model designed for powder X-ray diffraction (PXRD)-conditioned crystal structure prediction (PXRD-CSP). Unlike traditional CSP methods that rely primarily on composition or symmetry constraints, deCIFer explicitly incorporates PXRD data, directly generating crystal structures in the widely adopted Crystallographic Information File (CIF) format. The model is trained on nearly 2.3 million crystal structures, with PXRD conditioning augmented by basic forms of synthetic experimental artifacts, specifically Gaussian noise and instrumental peak broadening, to reflect fundamental real-world conditions. Validated across diverse synthetic datasets representative of challenging inorganic materials, deCIFer achieves a 94\% structural match rate. The evaluation is based on metrics such as the residual weighted profile ($R_{wp}$) and structural match rate (MR), chosen explicitly for their practical relevance in this inherently underdetermined problem. deCIFer establishes a robust baseline for future expansion toward more complex experimental scenarios, bridging the gap between computational predictions and experimental crystal structure determination.

Frederik Lizak Johansen, Ulrik Friis-Jensen, Erik Bj{\o}rnager Dam, Kirsten Marie {\O}rnsbjerg Jensen, Roc\'io Mercado, Raghavendra Selvan• 2025

Related benchmarks

TaskDatasetResultRank
Stable structure predictionCarbon-24
Match Rate37.96
21
Crystal Structure PredictionMP-20
Match Rate (%)44.65
13
Crystal Structure PredictionMPTS-52
Match Rate (MR)11.6
13
Crystal Structure GenerationPerov-5
Match Rate (%)85.59
12
Showing 4 of 4 rows

Other info

Follow for update