Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Nested Named Entity Recognition in Plasma Physics Research Articles

About

Named Entity Recognition (NER) is an important task in natural language processing that aims to identify and extract key entities from unstructured text. We present a novel application of NER in plasma physics research articles and address the challenges of extracting specialized entities from scientific text in this domain. Research articles in plasma physics often contain highly complex and context-rich content that must be extracted to enable, e.g., advanced search. We propose a lightweight approach based on encoder-transformers and conditional random fields to extract (nested) named entities from plasma physics research articles. First, we annotate a plasma physics corpus with 16 classes specifically designed for the nested NER task. Second, we evaluate an entity-specific model specialization approach, where independent BERT-CRF models are trained to recognize individual entity types in plasma physics text. Third, we integrate an optimization process to systematically fine-tune hyperparameters and enhance model performance. Our work contributes to the advancement of entity recognition in plasma physics and also provides a foundation to support researchers in navigating and analyzing scientific literature.

Muhammad Haris, Hans H\"oft, Markus M. Becker, Markus Stocker• 2026

Related benchmarks

TaskDatasetResultRank
Nested Named Entity RecognitionGENIA
F1 Score77
56
Nested Named Entity Recognitionplasma physics NNER dataset (test)
Precision64
11
Nested Named Entity RecognitionChilean Waiting List
Precision79
7
Showing 3 of 3 rows

Other info

Follow for update