Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CODER: Knowledge infused cross-lingual medical term embedding for term normalization

About

This paper proposes CODER: contrastive learning on knowledge graphs for cross-lingual medical term representation. CODER is designed for medical term normalization by providing close vector representations for different terms that represent the same or similar medical concepts with cross-lingual support. We train CODER via contrastive learning on a medical knowledge graph (KG) named the Unified Medical Language System, where similarities are calculated utilizing both terms and relation triplets from KG. Training with relations injects medical knowledge into embeddings and aims to provide potentially better machine learning features. We evaluate CODER in zero-shot term normalization, semantic similarity, and relation classification benchmarks, which show that CODERoutperforms various state-of-the-art biomedical word embedding, concept embeddings, and contextual embeddings. Our codes and models are available at https://github.com/GanjinZero/CODER.

Zheng Yuan, Zhengyun Zhao, Haixia Sun, Jiao Li, Fei Wang, Sheng Yu• 2020

Related benchmarks

TaskDatasetResultRank
Feature Selection AlignmentExpert-labeled disease feature relevance dataset
T1D Alignment Score80.3
26
Concept Similarity DetectionMulti-institutional EHR dataset
AUC96.9
25
Relatedness DetectionMulti-institutional EHR dataset
AUC0.831
25
Clinical Similarity DetectionGeneral Clinical Relation Pairs
AUC0.876
25
Feature selection evaluationGPT-4 Feature Relevance Estimation Suite Silver Standard (test)
T1D Score64.4
25
Clinical Relatedness DetectionGeneral Clinical Relation Pairs
AUC65.5
25
Cross-institutional code mappingUPMC LAB-LOINC
Spearman's Rank Correlation0.554
24
Cross-institutional code mappingUPMC PX-CCS
Spearman's Correlation0.418
24
Cross-institutional code mappingBDX CCAM-CCS
Spearman Correlation0.54
24
Medical Code MappingVA local laboratory codes to LOINC/LP
Top-1 Accuracy55.6
21
Showing 10 of 25 rows

Other info

Follow for update