Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Span Model for Open Information Extraction on Accurate Corpus

About

Open information extraction (Open IE) is a challenging task especially due to its brittle data basis. Most of Open IE systems have to be trained on automatically built corpus and evaluated on inaccurate test set. In this work, we first alleviate this difficulty from both sides of training and test sets. For the former, we propose an improved model design to more sufficiently exploit training dataset. For the latter, we present our accurately re-annotated benchmark test set (Re-OIE6) according to a series of linguistic observation and analysis. Then, we introduce a span model instead of previous adopted sequence labeling formulization for n-ary Open IE. Our newly introduced model achieves new state-of-the-art performance on both benchmark evaluation datasets.

Junlang Zhan, Hai Zhao• 2019

Related benchmarks

TaskDatasetResultRank
Open Information ExtractionCaRB (test)
F1 Score50
53
Open Information ExtractionOIE 2016 (test)
F169.42
32
Open Information ExtractionRe-OIE 2016 (test)
AUC68
20
Open Information ExtractionReOIE (test)
F1 Score77
13
Open Information ExtractionCaRB standard (test)
F1 Score48.5
12
Open Information ExtractionWire57-C standard (test)
F1 Score31.9
12
Open Information ExtractionCaRB 1-1 one-to-one mapping variant (test)
F1 Score32.9
12
Open Information ExtractionCaRB-nary English
F1 Score49.4
10
Showing 8 of 8 rows

Other info

Code

Follow for update