Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning to Discover Regulatory Elements for Gene Expression Prediction

About

We consider the problem of predicting gene expressions from DNA sequences. A key challenge of this task is to find the regulatory elements that control gene expressions. Here, we introduce Seq2Exp, a Sequence to Expression network explicitly designed to discover and extract regulatory elements that drive target gene expression, enhancing the accuracy of the gene expression prediction. Our approach captures the causal relationship between epigenomic signals, DNA sequences and their associated regulatory elements. Specifically, we propose to decompose the epigenomic signals and the DNA sequence conditioned on the causal active regulatory elements, and apply an information bottleneck with the Beta distribution to combine their effects while filtering out non-causal components. Our experiments demonstrate that Seq2Exp outperforms existing baselines in gene expression prediction tasks and discovers influential regions compared to commonly used statistical methods for peak detection such as MACS3. The source code is released as part of the AIRS library (https://github.com/divelab/AIRS/).

Xingyu Su, Haiyang Yu, Degui Zhi, Shuiwang Ji• 2025

Related benchmarks

TaskDatasetResultRank
Gene Expression CAGE PredictionK562
MSE0.1856
10
Gene Expression CAGE PredictionGM12878
MSE0.1873
10
gene expression predictionH1 cell line
MSE0.2784
3
Showing 3 of 3 rows

Other info

Follow for update