Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Protein-Nucleic Acid Complex Modeling with Frame Averaging Transformer

About

Nucleic acid-based drugs like aptamers have recently demonstrated great therapeutic potential. However, experimental platforms for aptamer screening are costly, and the scarcity of labeled data presents a challenge for supervised methods to learn protein-aptamer binding. To this end, we develop an unsupervised learning approach based on the predicted pairwise contact map between a protein and a nucleic acid and demonstrate its effectiveness in protein-aptamer binding prediction. Our model is based on FAFormer, a novel equivariant transformer architecture that seamlessly integrates frame averaging (FA) within each transformer block. This integration allows our model to infuse geometric information into node features while preserving the spatial semantics of coordinates, leading to greater expressive power than standard FA models. Our results show that FAFormer outperforms existing equivariant models in contact map prediction across three protein complex datasets, with over 10% relative improvement. Moreover, we curate five real-world protein-aptamer interaction datasets and show that the contact map predicted by FAFormer serves as a strong binding indicator for aptamer screening.

Tinglin Huang, Zhenqiao Song, Rex Ying, Wengong Jin• 2024

Related benchmarks

TaskDatasetResultRank
Aptamer ScreeningGFP
Top-10 Precision0.4
12
Aptamer ScreeningHNRNPC
Top-10 Precision33.33
10
Aptamer ScreeningNELF
Top-10 Precision23.33
9
Aptamer ScreeningUBLCP1
Top-10 Precision18
7
Contact Map PredictionProtein-RNA (test)
F1 Score12.84
7
Contact Map PredictionProtein-DNA (test)
F1 Score14.57
7
Contact Map PredictionProtein-Protein (test)
F1 Score15.96
7
Aptamer ScreeningCHK2
Top-10 Precision10
7
Binding site predictionProtein-DNA
F1 Score50.6
3
Binding site predictionProtein-RNA
F1 Score47.2
3
Showing 10 of 13 rows

Other info

Code

Follow for update