Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Easy-to-Hard Learning for Information Extraction

About

Information extraction (IE) systems aim to automatically extract structured information, such as named entities, relations between entities, and events, from unstructured texts. While most existing work addresses a particular IE task, universally modeling various IE tasks with one model has achieved great success recently. Despite their success, they employ a one-stage learning strategy, i.e., directly learning to extract the target structure given the input text, which contradicts the human learning process. In this paper, we propose a unified easy-to-hard learning framework consisting of three stages, i.e., the easy stage, the hard stage, and the main stage, for IE by mimicking the human learning process. By breaking down the learning process into multiple stages, our framework facilitates the model to acquire general IE task knowledge and improve its generalization ability. Extensive experiments across four IE tasks demonstrate the effectiveness of our framework. We achieve new state-of-the-art results on 13 out of 17 datasets. Our code is available at \url{https://github.com/DAMO-NLP-SG/IE-E2H}.

Chang Gao, Wenxuan Zhang, Wai Lam, Lidong Bing• 2023

Related benchmarks

TaskDatasetResultRank
Named Entity RecognitionCoNLL 03
F1 (Entity)92.43
102
Named Entity RecognitionACE05
F1 Score86.25
38
Relation ExtractionCoNLL04 (test)
F1 Score75.31
28
Relation ExtractionSciERC
Relation Strict F139
28
Event extractionACE05 Evt
Event Trigger F173.5
26
Event extractionCASIE--
14
Relation ExtractionACE Rel 05
F1 Score66.21
13
Named Entity RecognitionACE 2004
Entity F187.06
12
aspect sentiment triplet extractionASTE benchmark suite Rest14, Laptop14, Rest15, Rest16
Rest14 Score75.92
7
Event extractionACE05-E
Trigger F172.19
7
Showing 10 of 12 rows

Other info

Code

Follow for update