Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Neural Symbolic Regression that Scales

About

Symbolic equations are at the core of scientific discovery. The task of discovering the underlying equation from a set of input-output pairs is called symbolic regression. Traditionally, symbolic regression methods use hand-designed strategies that do not improve with experience. In this paper, we introduce the first symbolic regression method that leverages large scale pre-training. We procedurally generate an unbounded set of equations, and simultaneously pre-train a Transformer to predict the symbolic equation from a corresponding set of input-output-pairs. At test time, we query the model on a new set of points and use its output to guide the search for the equation. We show empirically that this approach can re-discover a set of well-known physical equations, and that it improves over time with more data and compute.

Luca Biggio, Tommaso Bendinelli, Alexander Neitz, Aurelien Lucchi, Giambattista Parascandolo• 2021

Related benchmarks

TaskDatasetResultRank
Symbolic RegressionSRBench black-box (test)
R^20.1228
53
Symbolic RegressionOscillation 1 LLM-SR Suite
NMSE0.004
30
Symbolic RegressionDGSR benchmark
Recall100
22
Symbolic RegressionLSR-Synth
Overall Acc (Tol 0.01)3.1
22
Symbolic RegressionStrogatz Dataset epsilon=0.001 (test)
R2 Score0.5219
20
Symbolic RegressionStrogatz Dataset epsilon=0.01 (test)
R2 Score0.5179
20
Symbolic RegressionStrogatz Dataset epsilon=0.1 (test)
R250.54
20
Symbolic RegressionFeynman Dataset epsilon=0.1 (test)
R2 Score0.3823
20
Symbolic RegressionFeynman Dataset epsilon=0.001 (test)
R239.79
20
Symbolic RegressionFeynman Dataset epsilon=0.01 (test)
R20.3942
20
Showing 10 of 38 rows

Other info

Follow for update