Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GAUCHE: A Library for Gaussian Processes in Chemistry

About

We introduce GAUCHE, a library for GAUssian processes in CHEmistry. Gaussian processes have long been a cornerstone of probabilistic machine learning, affording particular advantages for uncertainty quantification and Bayesian optimisation. Extending Gaussian processes to chemical representations, however, is nontrivial, necessitating kernels defined over structured inputs such as graphs, strings and bit vectors. By defining such kernels in GAUCHE, we seek to open the door to powerful tools for uncertainty quantification and Bayesian optimisation in chemistry. Motivated by scenarios frequently encountered in experimental chemistry, we showcase applications for GAUCHE in molecular discovery and chemical reaction optimisation. The codebase is made available at https://github.com/leojklarner/gauche

Ryan-Rhys Griffiths, Leo Klarner, Henry B. Moss, Aditya Ravuri, Sang Truong, Samuel Stanton, Gary Tom, Bojana Rankovic, Yuanqi Du, Arian Jamasb, Aryan Deshwal, Julius Schwartz, Austin Tripp, Gregory Kell, Simon Frieder, Anthony Bourached, Alex Chan, Jacob Moss, Chengzhi Guo, Johannes Durholt, Saudamini Chaurasia, Felix Strieth-Kalthoff, Alpha A. Lee, Bingqing Cheng, Al\'an Aspuru-Guzik, Philippe Schwaller, Jian Tang• 2022

Related benchmarks

TaskDatasetResultRank
Virtual Screening100 Proteins
Median Percent Lift (Avg Top-10)31
45
Protein hit discoveryChEMBL Proteins
Win % (Avg Top-10, PIC 7.0)9
20
Ligand discovery100 Proteins Min Top-3 endpoint
MLT (Target PIC 7.0)13
11
Ligand discovery100 Proteins Average Top-10 endpoint
MLT Score (PIC 7.0)18
11
Ligand scoring1,000,000 ligands
Inference Time (s)39
4
Showing 5 of 5 rows

Other info

Follow for update