Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Neural Symbolic Regression that Scales

About

Symbolic equations are at the core of scientific discovery. The task of discovering the underlying equation from a set of input-output pairs is called symbolic regression. Traditionally, symbolic regression methods use hand-designed strategies that do not improve with experience. In this paper, we introduce the first symbolic regression method that leverages large scale pre-training. We procedurally generate an unbounded set of equations, and simultaneously pre-train a Transformer to predict the symbolic equation from a corresponding set of input-output-pairs. At test time, we query the model on a new set of points and use its output to guide the search for the equation. We show empirically that this approach can re-discover a set of well-known physical equations, and that it improves over time with more data and compute.

Luca Biggio, Tommaso Bendinelli, Alexander Neitz, Aurelien Lucchi, Giambattista Parascandolo• 2021

Related benchmarks

TaskDatasetResultRank
Symbolic RegressionOscillation 1 LLM-SR Suite
NMSE0.004
30
Symbolic RegressionSRBench black-box (test)
R^20.1228
28
Symbolic RegressionDGSR benchmark
Recall100
22
Symbolic RegressionLSR-Synth
Overall Acc (Tol 0.01)3.1
22
Symbolic RegressionStrogatz Dataset epsilon=0.001 (test)
R2 Score0.5219
20
Symbolic RegressionStrogatz Dataset epsilon=0.01 (test)
R2 Score0.5179
20
Symbolic RegressionStrogatz Dataset epsilon=0.1 (test)
R250.54
20
Symbolic RegressionFeynman Dataset epsilon=0.1 (test)
R2 Score0.3823
20
Symbolic RegressionFeynman Dataset epsilon=0.001 (test)
R239.79
20
Symbolic RegressionFeynman Dataset epsilon=0.01 (test)
R20.3942
20
Showing 10 of 27 rows

Other info

Follow for update