Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GESR: A Genetic Programming-Based Symbolic Regression Method with Gene Editing

About

Mathematical formulas serve as a language through which humans communicate with nature. Discovering mathematical laws from scientific data to describe natural phenomena has been a long-standing pursuit of humanity for centuries. In the field of artificial intelligence, this challenge is known as the symbolic regression problem. Among existing symbolic regression approaches, Genetic Programming (GP) based on evolutionary algorithms remains one of the most classical and widely adopted methods. GP simulates the evolutionary process across generations through genetic mutation and crossover. However, mutations and crossovers in GP are entirely random. While this randomness effectively mimics natural evolution, it inevitably produces both beneficial and detrimental variations. If there existed a metaphorical `God` capable of foreseeing which genetic mutations or crossovers would yield superior outcomes and performing targeted gene editing accordingly, the efficiency of evolution could be substantially improved. Motivated by this idea, we propose in this paper a symbolic regression approach based on gene editing, termed GESR. In GESR, we trained two "hands of God" (two BERT models). Among them, the first leverages the BERT's masked language modeling capability to guide the mutation of genes (expression symbols). The other BERT model guides the crossover of individual genes by predicting the crossover point. Experimental results demonstrate that GESR significantly improves computational efficiency compared with traditional GP algorithms and achieves strong overall performance across multiple symbolic regression tasks.

Yanjie Li, Liping Zhang, Min Wu, Weijun Li, Lina Yu, Jingyi Liu, Yusong Deng, Mingzhu Wan, Xin Ning• 2026

Related benchmarks

TaskDatasetResultRank
Symbolic RegressionSRBench Feynman
R^2 Score0.9946
16
Symbolic RegressionSRBench Black-box
R^20.9364
16
Symbolic RegressionNguyen
R^299.99
15
Symbolic RegressionKeijzer
R^20.9998
15
Symbolic RegressionConstant
R^20.9997
15
Symbolic RegressionLivermore
R^299.62
15
Symbolic RegressionKorns
R^20.9964
15
Symbolic RegressionVladislavleva
R^299.92
15
Symbolic RegressionSRSD-Feynman Medium
NED54.5
12
Symbolic RegressionSRSD-Feynman Hard
NED0.602
12
Showing 10 of 31 rows

Other info

Follow for update