Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Neural Symbolic Regression that Scales

About

Symbolic equations are at the core of scientific discovery. The task of discovering the underlying equation from a set of input-output pairs is called symbolic regression. Traditionally, symbolic regression methods use hand-designed strategies that do not improve with experience. In this paper, we introduce the first symbolic regression method that leverages large scale pre-training. We procedurally generate an unbounded set of equations, and simultaneously pre-train a Transformer to predict the symbolic equation from a corresponding set of input-output-pairs. At test time, we query the model on a new set of points and use its output to guide the search for the equation. We show empirically that this approach can re-discover a set of well-known physical equations, and that it improves over time with more data and compute.

Luca Biggio, Tommaso Bendinelli, Alexander Neitz, Aurelien Lucchi, Giambattista Parascandolo• 2021

Related benchmarks

TaskDatasetResultRank
Symbolic RegressionSRBench black-box (test)
R^20.123
71
Symbolic RegressionSRBench Strogatz (test)
Mean Test R^20.521
59
Symbolic RegressionSRBench Feynman (test)
Mean Test R^239.6
57
Symbolic RegressionOscillator 1 (OOD)
NMSE0.542
32
Symbolic RegressionOscillation 1 LLM-SR Suite
NMSE0.004
30
Symbolic RegressionDGSR benchmark
Recall100
22
Symbolic RegressionLSR-Synth
Overall Acc (Tol 0.01)3.1
22
Symbolic RegressionStrogatz Dataset epsilon=0.001 (test)
R2 Score0.5219
20
Symbolic RegressionStrogatz Dataset epsilon=0.01 (test)
R2 Score0.5179
20
Symbolic RegressionStrogatz Dataset epsilon=0.1 (test)
R250.54
20
Showing 10 of 53 rows

Other info

Follow for update