Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM-SRBENCH

Benchmarks

Task NameDataset NameSOTA ResultTrend
Symbolic RegressionLLM-SRBench (phys_osc)
Best Reward8.9985
16
Symbolic RegressionLLM-SRBench matsci
Best Reward8.25
16
Symbolic RegressionLLM-SRBench chem_react
Best Reward9
16
Symbolic RegressionLLM-SRBench bio_pop_growth
Best Reward8.9725
16
Symbolic RegressionLLM-SRBench Symbolic
Term Recall34.4
14
Symbolic RegressionLLM-SRBench OOD (test)
NMSE0.325
14
Symbolic RegressionLLM-SRBench ID (test)
NMSE0.4
14
Symbolic RegressionLLM-SRBENCH LSR-Transform
NMSE0.067
13
Symbolic RegressionLLM-SRBENCH LSR-Syn
Chemistry Error0
9
Showing 9 of 9 rows