Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AutoPCR: Automated Phenotype Concept Recognition by Prompting

About

Motivation: Phenotype concept recognition (CR) is a fundamental task in biomedical text mining. However, existing methods either require ontology-specific training, making them struggle to generalize across diverse text styles and evolving biomedical terminology, or depend on general-purpose large language models (LLMs) that lack necessary domain knowledge. Results: To address these limitations, we propose AutoPCR, a prompt-based phenotype CR method designed to automatically generalize to new ontologies and unseen data without ontology-specific training. To further boost performance, we also introduce an optional self-supervised training strategy. Experiments show that AutoPCR achieves the best average and most robust performance across datasets. Further ablation and transfer studies demonstrate its inductive capability and generalizability to new ontologies. Availability and Implementation: Our code is available at https://github.com/yctao7/AutoPCR. Contact: drjieliu@umich.edu

Yicheng Tao, Yuanhao Huang, Yiqun Wang, Xin Luo, Jie Liu• 2025

Related benchmarks

TaskDatasetResultRank
Document-level phenotype concept recognitionBIOC-GS
Precision (P)84.57
12
Document-level phenotype concept recognitionAverage BIOC-GS GSC-2024 ID-68
Average F1 Score79.79
12
Phenotype Concept RecognitionBIOC-GS (test)
Precision (P)84.25
12
Document-level phenotype concept recognitionID-68
Precision88.77
12
Phenotype Concept RecognitionGSC 2024 (test)
Precision (P)89.08
12
Phenotype Concept RecognitionID-68 (test)
Precision87.84
12
Document-level phenotype concept recognitionGSC 2024
Precision91.62
12
Phenotype Concept RecognitionNCBI
mF173.15
5
Showing 8 of 8 rows

Other info

Follow for update