Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

In-Context Symbolic Regression: Leveraging Large Language Models for Function Discovery

About

State of the art Symbolic Regression (SR) methods currently build specialized models, while the application of Large Language Models (LLMs) remains largely unexplored. In this work, we introduce the first comprehensive framework that utilizes LLMs for the task of SR. We propose In-Context Symbolic Regression (ICSR), an SR method which iteratively refines a functional form with an LLM and determines its coefficients with an external optimizer. ICSR leverages LLMs' strong mathematical prior both to propose an initial set of possible functions given the observations and to refine them based on their errors. Our findings reveal that LLMs are able to successfully find symbolic equations that fit the given data, matching or outperforming the overall performance of the best SR baselines on four popular benchmarks, while yielding simpler equations with better out of distribution generalization.

Matteo Merler, Katsiaryna Haitsiukevich, Nicola Dainese, Pekka Marttinen• 2024

Related benchmarks

TaskDatasetResultRank
Symbolic RegressionLung Cancer with Chemo & Radio (LC-CR) (test)
MSE6.1
22
Symbolic RegressionWarfarin PK (Warf) (test)
MSE0.497
15
Symbolic RegressionCOVID-19 (C-19) (test)
MSE1.03e-7
15
Symbolic RegressionLung Cancer with Chemo (LC-C) (test)
MSE0.688
15
Symbolic RegressionLung Cancer (LC) (test)
MSE0.407
15
Symbolic RegressionLLM-SRBench ID (test)
NMSE1.15
14
Symbolic RegressionLLM-SRBench OOD (test)
NMSE0.87
14
Symbolic RegressionLLM-SRBench Symbolic
Term Recall13.1
14
Symbolic Discovery of Ordinary Differential EquationsSIR(2D) ID
Residual NMSE2.80e-8
10
Symbolic Discovery of Ordinary Differential EquationsSIR 2D ID-Ext
Residual NMSE4.62e-9
8
Showing 10 of 22 rows

Other info

Follow for update