Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Prior-Guided Symbolic Regression: Towards Scientific Consistency in Equation Discovery

About

Symbolic Regression (SR) aims to discover interpretable equations from observational data, with the potential to reveal underlying principles behind natural phenomena. However, existing approaches often fall into the Pseudo-Equation Trap: producing equations that fit observations well but remain inconsistent with fundamental scientific principles. A key reason is that these approaches are dominated by empirical risk minimization, lacking explicit constraints to ensure scientific consistency. To bridge this gap, we propose PG-SR, a prior-guided SR framework built upon a three-stage pipeline consisting of warm-up, evolution, and refinement. Throughout the pipeline, PG-SR introduces a prior constraint checker that explicitly encodes domain priors as executable constraint programs, and employs a Prior Annealing Constrained Evaluation (PACE) mechanism during the evolution stage to progressively steer discovery toward scientifically consistent regions. Theoretically, we prove that PG-SR reduces the Rademacher complexity of the hypothesis space, yielding tighter generalization bounds and establishing a guarantee against pseudo-equations. Experimentally, PG-SR outperforms state-of-the-art baselines across diverse domains, maintaining robustness to varying prior quality, noisy data, and data scarcity.

Jing Xiao, Xinhai Chen, Jiaming Peng, Qinglin Wang, Menghan Jia, Zhiquan Lai, Guangping Yu, Dongsheng Li, Tiejun Li, Jie Liu• 2026

Related benchmarks

TaskDatasetResultRank
Symbolic RegressionE. coli growth LLM-SR Suite
NMSE0.0011
44
Symbolic RegressionOscillation 1 LLM-SR Suite
NMSE6.62e-14
30
Symbolic RegressionStress–Strain (ID)
NMSE0.0061
18
Symbolic RegressionStress–Strain (OOD)
NMSE0.0321
18
Symbolic RegressionCRK (ID)
NMSE1.74e-11
18
Symbolic RegressionCRK (OOD)
NMSE1.22e-8
18
Symbolic RegressionOscillator 1 (OOD)
NMSE2.95e-8
18
Symbolic RegressionOscillator 2 (ID)
NMSE1.79e-11
18
Symbolic RegressionOscillator 2 (OOD)
NMSE8.32e-11
18
Showing 9 of 9 rows

Other info

Follow for update