Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

NIMMGen: Learning Neural-Integrated Mechanistic Digital Twins with LLMs

About

Mechanistic models encode scientific knowledge about dynamical systems and are widely used in downstream scientific and policy applications. Recent work has explored LLM-based agentic frameworks to automatically construct mechanistic models from data; however, existing problem settings substantially oversimplify real-world conditions, leaving it unclear whether LLM-generated mechanistic models are reliable in practice. To address this gap, we introduce the Neural-Integrated Mechanistic Modeling (NIMM) evaluation framework, which evaluates LLM-generated mechanistic models under realistic settings with partial observations and diversified task objectives. Our evaluation reveals fundamental challenges in current baselines, ranging from model effectiveness to code-level correctness. Motivated by these findings, we design NIMMgen, an agentic framework for neural-integrated mechanistic modeling that enhances code correctness and practical validity through iterative refinement. Experiments across three datasets from diversified scientific domains demonstrate its strong performance. We also show that the learned mechanistic models support counterfactual intervention simulation.

Zihan Guan, Rituparna Datta, Mengxuan Hu, Shunshun Liu, Aiying Zhang, Prasanna Balachandran, Sheng Li, Anil Vullikanti• 2026

Related benchmarks

TaskDatasetResultRank
Spatial-temporal ForecastingCOVID-Bogota
RMSE333.9
9
Spatial-temporal ForecastingCOVID-Medellin
RMSE524.3
9
Spatial-temporal ForecastingInfluenza-USA
RMSE4.81
9
Spatial-temporal ForecastingMRSA-Virginia
RMSE39.83
9
Clinical health forecastingLung cancer
RMSE1.41
3
Clinical health forecastingLung Cancer w/ Chemo.
RMSE0.08
3
Clinical health forecastingLung Cancer w/ Chemo. & Radio.
RMSE0.06
3
Yield strength predictionFCC High-Entropy Alloys (HEAs) room temperature
RMSE139.2
2
Yield strength predictionBCC High-Entropy Alloys (HEAs) temperature-dependent
RMSE180.1
2
Showing 9 of 9 rows

Other info

Follow for update