Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-Task Identification of Entities, Relations, and Coreference for Scientific Knowledge Graph Construction

About

We introduce a multi-task setup of identifying and classifying entities, relations, and coreference clusters in scientific articles. We create SciERC, a dataset that includes annotations for all three tasks and develop a unified framework called Scientific Information Extractor (SciIE) for with shared span representations. The multi-task setup reduces cascading errors between tasks and leverages cross-sentence relations through coreference links. Experiments show that our multi-task model outperforms previous models in scientific information extraction without using any domain-specific features. We further show that the framework supports construction of a scientific knowledge graph, which we use to analyze information in scientific literature.

Yi Luan, Luheng He, Mari Ostendorf, Hannaneh Hajishirzi• 2018

Related benchmarks

TaskDatasetResultRank
Relation ExtractionSCIERC (test)
F1 Score39.3
23
Entity recognitionSCIERC (test)
F1 Score64.2
20
Keyphrase ExtractionSemEval Task 10 ScienceIE 2017 (test)
F1 Score46
15
Coreference ResolutionSCIERC (test)
Precision52
7
Entity recognitionSCIERC (dev)
Precision70
6
Coreference ResolutionSTM corpus five-fold cross validation (test)
MUC Precision60.3
6
Relation ExtractionSCIERC (dev)
Precision45.4
4
TDM Triple ExtractionNLP-TDMS excluding papers with 'Unknown' annotation (test)
Macro Precision24.9
4
Task + Dataset + Metric ExtractionNLP-TDMS (test)
Macro Precision0.181
4
Span IdentificationSemEval ScienceIE Task 10 2017 (test)
F1 Score58.6
3
Showing 10 of 13 rows

Other info

Code

Follow for update