Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

IMoJIE: Iterative Memory-Based Joint Open Information Extraction

About

While traditional systems for Open Information Extraction were statistical and rule-based, recently neural models have been introduced for the task. Our work builds upon CopyAttention, a sequence generation OpenIE model (Cui et. al., 2018). Our analysis reveals that CopyAttention produces a constant number of extractions per sentence, and its extracted tuples often express redundant information. We present IMoJIE, an extension to CopyAttention, which produces the next extraction conditioned on all previously extracted tuples. This approach overcomes both shortcomings of CopyAttention, resulting in a variable number of diverse extractions per sentence. We train IMoJIE on training data bootstrapped from extractions of several non-neural systems, which have been automatically filtered to reduce redundancy and noise. IMoJIE outperforms CopyAttention by about 18 F1 pts, and a BERT-based strong baseline by 2 F1 pts, establishing a new state of the art for the task.

Keshav Kolluru, Samarth Aggarwal, Vipul Rathore, Mausam, Soumen Chakrabarti• 2020

Related benchmarks

TaskDatasetResultRank
Open Information ExtractionCaRB (test)
F1 Score56.8
53
Open Information ExtractionOIE 2016 (test)
F156.8
32
Open Information ExtractionReOIE (test)
F1 Score76.2
13
Open Information ExtractionCaRB standard (test)
F1 Score53.5
12
Open Information ExtractionWire57-C standard (test)
F1 Score36
12
Open Information ExtractionCaRB 1-1 one-to-one mapping variant (test)
F1 Score41.4
12
Open Information ExtractionLSOIE sci (test)
F1 Score58.75
12
Open Information ExtractionLSOIE wiki (test)
F1 Score49.24
12
Open Information ExtractionCaRB Manual Subset (100 sentences) (val)
Precision90
6
Open Information ExtractionWire57 (test)
Precision (P)41.2
6
Showing 10 of 13 rows

Other info

Follow for update