Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SPECTRA: Spectral Domain-Aware Graph Generation for Imbalanced Molecular Property Regression

About

Molecular property regression struggles with cases in chemically relevant target ranges that are underrepresented in datasets. Standard average error minimization approaches underperform in these highly relevant cases, and oversampling approaches lead to meaningless molecular representations. In this paper, we propose SPECTRA, a spectral, domain-aware graph generation method designed to improve the prediction of underrepresented but relevant molecular property values. It combines a rarity-aware budgeting scheme to focus generation where data are scarce, target-neighbors graph alignment to establish structural correspondence, and interpolation of Laplacian spectra, node features, and targets. Coupled with spectral GNN using edge-aware Chebyshev convolutions, SPECTRA shows its effectiveness in property prediction benchmarks with competitive performance over leading state-of-the-art methods in relevant target ranges, while requiring ~4x less computational time.

Brenda Nogueira, Gisela A. Gonzalez-Montiel, Meng Jiang, Nitesh V. Chawla, Nuno Moniz• 2025

Related benchmarks

TaskDatasetResultRank
Molecular Property RegressionFreeSolv
MAE0.77
7
Molecular Property RegressionLipo
MAE0.38
7
Molecular Property RegressionQM9
MAE0.0036
7
Molecular Property RegressionESOL
MAE0.53
7
Showing 4 of 4 rows

Other info

Follow for update