Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction

About

Large Language Models (LLMs) combined with instruction tuning have made significant progress when generalizing to unseen tasks. However, they have been less successful in Information Extraction (IE), lagging behind task-specific models. Typically, IE tasks are characterized by complex annotation guidelines that describe the task and give examples to humans. Previous attempts to leverage such information have failed, even with the largest models, as they are not able to follow the guidelines out of the box. In this paper, we propose GoLLIE (Guideline-following Large Language Model for IE), a model able to improve zero-shot results on unseen IE tasks by virtue of being fine-tuned to comply with annotation guidelines. Comprehensive evaluation empirically demonstrates that GoLLIE is able to generalize to and follow unseen guidelines, outperforming previous attempts at zero-shot information extraction. The ablation study shows that detailed guidelines are key for good results.

Oscar Sainz, Iker Garc\'ia-Ferrero, Rodrigo Agerri, Oier Lopez de Lacalle, German Rigau, Eneko Agirre• 2023

Related benchmarks

TaskDatasetResultRank
Named Entity RecognitionConll 2003
F1 Score92.9
86
Named Entity RecognitionWnut 2017--
79
Named Entity RecognitionOntoNotes 5--
44
Named Entity RecognitionACE05
F1 Score89.1
38
Named Entity RecognitionCrossNER
AI Score59.1
35
Named Entity RecognitionMIT
Movie Entity Score63
28
Named Entity RecognitionCross-domain NER datasets out-of-domain
AI NER Score59.1
23
Named Entity RecognitionmultiNERD
Entity F177.5
20
Named Entity RecognitionFabNER
Entity F126.3
17
Named Entity RecognitionHarveyNER
Entity F141.3
15
Showing 10 of 32 rows

Other info

Code

Follow for update