GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer

About

Named Entity Recognition (NER) is essential in various Natural Language Processing (NLP) applications. Traditional NER models are effective but limited to a set of predefined entity types. In contrast, Large Language Models (LLMs) can extract arbitrary entities through natural language instructions, offering greater flexibility. However, their size and cost, particularly for those accessed via APIs like ChatGPT, make them impractical in resource-limited scenarios. In this paper, we introduce a compact NER model trained to identify any type of entity. Leveraging a bidirectional transformer encoder, our model, GLiNER, facilitates parallel entity extraction, an advantage over the slow sequential token generation of LLMs. Through comprehensive testing, GLiNER demonstrate strong performance, outperforming both ChatGPT and fine-tuned LLMs in zero-shot evaluations on various NER benchmarks.

Urchade Zaratiana, Nadi Tomeh, Pierre Holat, Thierry Charnois• 2023

Related benchmarks

Task	Dataset	Result
Named Entity Recognition	CoNLL 03	F1 (Entity)92.6	135
Named Entity Recognition	OntoNotes	F1-score89	121
Named Entity Recognition	BC5CDR	F1 Score88.7	102
Named Entity Recognition	OntoNotes 5.0	F1 Score88.1	90
Named Entity Recognition	Conll 2003	F1 Score92.6	86
Named Entity Recognition	MIT Movie	Entity F187.9	71
Named Entity Recognition	CrossNER	AI Score62.9	59
Named Entity Recognition	GENIA	F1 Score78.9	58
Named Entity Recognition	MIT Restaurant	--	57
Named Entity Recognition	multiNERD	Entity F193.8	50

Showing 10 of 48 rows

Other info

Follow for update

@wizwand_team Discord