drGT: Attention-Guided Gene Assessment of Drug Response Utilizing a Drug-Cell-Gene Heterogeneous Network

About

For translational impact, both accurate drug response prediction and biological plausibility of predictive features are needed. We present drGT, a heterogeneous graph deep learning model over drugs, genes, and cell lines that couples prediction with mechanism-oriented interpretability via attention coefficients (ACs). We assess both predictive generalization (random, unseen-drug, unseen-cell, and zero-shot splits) and biological plausibility (use of text-mined PubMed gene-drug co-mentions and comparison to a structure-based DTI predictor) on GDSC, NCI60, and CTRP datasets. Across benchmarks, drGT consistently delivers top regression performance while maintaining competitive classification accuracy for drug sensitivity. Under random 5-fold cross-validation, drGT attains an AUROC of up to 0.945 (3rd overall) and an $R^2$ up to 0.690, outperforming all baselines on regression. In leave-one-out tests for unseen cell lines and drugs, drGT achieves AUROCs of 0.706 and 0.844, and $R^2$ values of 0.692 and 0.022, the only model yielding positive $R^2$ for unseen drugs. In zero-shot prediction, drGT achieves an AUROC of 0.786 and a regression $R^2$ of 0.334, both representing the highest scores among all models. For interpretability, AC-derived drug-gene links recover known biology: among 976 drugs with known DTIs, 36.9% of predicted links match established DTIs, and 63.7% are supported by either PubMed abstracts or a structure-based predictive model. Enrichment analyses of AC-prioritized genes reveal drug-perturbed biological processes, providing pathway-level explanations. drGT advances predictive generalization and mechanism-centered interpretability, offering state-of-the-art regression accuracy and literature-supported biological hypotheses that demonstrate the use of graph learning from heterogeneous input data for biological discovery. Code: https://github.com/sciluna/drGT

Yoshitaka Inoue, Hunmin Lee, Tianfan Fu, Rui Kuang, Augustin Luna• 2024

Related benchmarks

Task	Dataset	Result
Regression	DS2	R-Squared0.544	16
Classification	GDSC2 unseen drugs (Drug)	AUROC84.4	14
Drug response prediction	GDSC2 Seen pairs v1.0	R2 Score0.354	7
Drug response prediction	GDSC2 Unseen pairs (n=4,219) v1.0	R2 Score0.261	7
Drug response prediction	GDSC2 v1.0	R2 Score0.334	7
Drug-cell response classification	GDSC2 Overall	AUROC78.6	7
Regression	NCI60	R^20.69	7
Regression	CTRP	R^20.58	7
Regression	GDSC1	R^20.475	7
Regression	GDSC2 (unseen cell lines)	R^20.692	7

Showing 10 of 17 rows

Other info

Follow for update

@wizwand_team Discord