Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Semantic NLP Pipelines for Interoperable Patient Digital Twins from Unstructured EHRs

About

Digital twins -- virtual replicas of physical entities -- are gaining traction in healthcare for personalized monitoring, predictive modeling, and clinical decision support. However, generating interoperable patient digital twins from unstructured electronic health records (EHRs) remains challenging due to variability in clinical documentation and lack of standardized mappings. This paper presents a semantic NLP-driven pipeline that transforms free-text EHR notes into FHIR-compliant digital twin representations. The pipeline leverages named entity recognition (NER) to extract clinical concepts, concept normalization to map entities to SNOMED-CT or ICD-10, and relation extraction to capture structured associations between conditions, medications, and observations. Evaluation on MIMIC-IV Clinical Database Demo with validation against MIMIC-IV-on-FHIR reference mappings demonstrates high F1-scores for entity and relation extraction, with improved schema completeness and interoperability compared to baseline methods.

Rafael Brens, Yuqiao Meng, Luoxi Tang, Zhaohan Xi• 2026

Related benchmarks

TaskDatasetResultRank
FHIR Resource AssemblyMIMIC-IV Demo v2.2 (test)
Semantic Completeness0.91
3
Named Entity RecognitionMIMIC-IV Demo v2.2 (test)
NER F1 Score89
3
Relation ExtractionMIMIC-IV Demo v2.2 (test)
RE F181
3
Showing 3 of 3 rows

Other info

Follow for update