Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Toward Closed-loop Molecular Discovery via Language Model, Property Alignment and Strategic Search

About

Drug discovery is a time-consuming and expensive process, with traditional high-throughput and docking-based virtual screening hampered by low success rates and limited scalability. Recent advances in generative modelling, including autoregressive, diffusion, and flow-based approaches, have enabled de novo ligand design beyond the limits of enumerative screening. Yet these models often suffer from inadequate generalization, limited interpretability, and an overemphasis on binding affinity at the expense of key pharmacological properties, thereby restricting their translational utility. Here we present Trio, a molecular generation framework integrating fragment-based molecular language modeling, reinforcement learning, and Monte Carlo tree search, for effective and interpretable closed-loop targeted molecular design. Through the three key components, Trio enables context-aware fragment assembly, enforces physicochemical and synthetic feasibility, and guides a balanced search between the exploration of novel chemotypes and the exploitation of promising intermediates within protein binding pockets. Experimental results show that Trio reliably achieves chemically valid and pharmacologically enhanced ligands, outperforming state-of-the-art approaches with improved binding affinity (+7.85%), drug-likeness (+11.10%) and synthetic accessibility (+12.05%), while expanding molecular diversity more than fourfold. By combining generalization, plausibility, and interpretability, Trio establishes a closed-loop generative paradigm that redefines how chemical space can be navigated, offering a transformative foundation for the next era of AI-driven drug discovery.

Junkai Ji, Zhangfan Yang, Dong Xu, Ruibin Bai, Jianqiang Li, Tingjun Hou, Zexuan Zhu• 2025

Related benchmarks

TaskDatasetResultRank
Molecular Dockingparp1
Mean Docking Score-13.129
18
Molecular Dockingfa7
Mean Docking Score-10.359
18
Molecular Docking5ht1b
Mean Docking Score-12.954
18
Molecular Dockingjak2
Mean Docking Score-11.806
18
Molecular Dockingbraf
Mean Docking Score-12.591
17
Showing 5 of 5 rows

Other info

Follow for update