Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

A Neural Symbolic Model for Space Physics

About

In this study, we unveil a new AI model, termed PhyE2E, to discover physical formulas through symbolic regression. PhyE2E simplifies symbolic regression by decomposing it into sub-problems using the second-order derivatives of an oracle neural network, and employs a transformer model to translate data into symbolic formulas in an end-to-end manner. The resulting formulas are refined through Monte-Carlo Tree Search and Genetic Programming. We leverage a large language model to synthesize extensive symbolic expressions resembling real physics, and train the model to recover these formulas directly from data. A comprehensive evaluation reveals that PhyE2E outperforms existing state-of-the-art approaches, delivering superior symbolic accuracy, precision in data fitting, and consistency in physical units. We deployed PhyE2E to five applications in space physics, including the prediction of sunspot numbers, solar rotational angular velocity, emission line contribution functions, near-Earth plasma pressure, and lunar-tide plasma signals. The physical formulas generated by AI demonstrate a high degree of accuracy in fitting the experimental data from satellites and astronomical telescopes. We have successfully upgraded the formula proposed by NASA in 1993 regarding solar activity, and for the first time, provided the explanations for the long cycle of solar activity in an explicit form. We also found that the decay of near-Earth plasma pressure is proportional to r^2 to Earth, where subsequent mathematical derivations are consistent with satellite data from another independent study. Moreover, we found physical formulas that can describe the relationships between emission lines in the extreme ultraviolet spectrum of the Sun, temperatures, electron densities, and magnetic fields. The formula obtained is consistent with the properties that physicists had previously hypothesized it should possess.

Jie Ying, Haowei Lin, Chao Yue, Yajie Chen, Chao Xiao, Quanqi Shi, Yitao Liang, Shing-Tung Yau, Yuan Zhou, Jianzhu Ma• 2025

Related benchmarks

TaskDatasetResultRank
Symbolic RegressionE. coli growth LLM-SR Suite
NMSE0.214
44
Symbolic RegressionOscillation 1 LLM-SR Suite
NMSE0.0059
30
Symbolic RegressionStress–Strain (ID)
NMSE0.0321
18
Symbolic RegressionStress–Strain (OOD)
NMSE0.147
18
Symbolic RegressionOscillator 2 (ID)
NMSE0.0993
18
Symbolic RegressionOscillator 1 (OOD)
NMSE0.0165
18
Symbolic RegressionCRK (ID)
NMSE1.26e-5
18
Symbolic RegressionCRK (OOD)
NMSE0.0018
18
Symbolic RegressionOscillator 2 (OOD)
NMSE0.636
18
Showing 9 of 9 rows

Other info

Follow for update