Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AI Feynman 2.0: Pareto-optimal symbolic regression exploiting graph modularity

About

We present an improved method for symbolic regression that seeks to fit data to formulas that are Pareto-optimal, in the sense of having the best accuracy for a given complexity. It improves on the previous state-of-the-art by typically being orders of magnitude more robust toward noise and bad data, and also by discovering many formulas that stumped previous methods. We develop a method for discovering generalized symmetries (arbitrary modularity in the computational graph of a formula) from gradient properties of a neural network fit. We use normalizing flows to generalize our symbolic regression method to probability distributions from which we only have samples, and employ statistical hypothesis testing to accelerate robust brute-force search.

Silviu-Marian Udrescu, Andrew Tan, Jiahai Feng, Orisvaldo Neto, Tailin Wu, Max Tegmark• 2020

Related benchmarks

TaskDatasetResultRank
Symbolic RegressionSRBench black-box (test)
R^20.211
28
Symbolic RegressionFeynman Dataset ϵ = 0.0 (test)
R^20.9314
20
Symbolic RegressionFeynman Dataset epsilon=0.001 (test)
R291.77
20
Symbolic RegressionFeynman Dataset epsilon=0.01 (test)
R20.8732
20
Symbolic RegressionStrogatz Dataset epsilon=0.01 (test)
R2 Score0.7753
20
Symbolic RegressionStrogatz Dataset epsilon=0.001 (test)
R2 Score0.6855
20
Symbolic RegressionStrogatz Dataset ϵ = 0.0 (test)
R^20.6459
20
Symbolic RegressionStrogatz Dataset epsilon=0.1 (test)
R231.7
20
Symbolic RegressionFeynman Dataset epsilon=0.1 (test)
R2 Score0.2248
20
Showing 9 of 9 rows

Other info

Follow for update