UniEX: An Effective and Efficient Framework for Unified Information Extraction via a Span-extractive Perspective
About
We propose a new paradigm for universal information extraction (IE) that is compatible with any schema format and applicable to a list of IE tasks, such as named entity recognition, relation extraction, event extraction and sentiment analysis. Our approach converts the text-based IE tasks as the token-pair problem, which uniformly disassembles all extraction targets into joint span detection, classification and association problems with a unified extractive framework, namely UniEX. UniEX can synchronously encode schema-based prompt and textual information, and collaboratively learn the generalized knowledge from pre-defined information using the auto-encoder language models. We develop a traffine attention mechanism to integrate heterogeneous factors including tasks, labels and inside tokens, and obtain the extraction target via a scoring matrix. Experiment results show that UniEX can outperform generative universal IE models in terms of performance and inference-speed on $14$ benchmarks IE datasets with the supervised setting. The state-of-the-art performance in low-resource scenarios also verifies the transferability and effectiveness of UniEX.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Named Entity Recognition | CoNLL 03 | F1 (Entity)92.65 | 102 | |
| Few-shot Named Entity Recognition | FewNERD Intra 1.0 | F1 Score63.26 | 44 | |
| Relation Extraction | CONLL04 | Relation Strict F173.4 | 43 | |
| Relation Extraction | SciERC | Relation Strict F138 | 28 | |
| Event extraction | ACE05 Evt | Event Trigger F174.08 | 26 | |
| Few-shot Named Entity Recognition | FewNERD Inter 1.0 | F1 Score73.79 | 20 | |
| Relation Extraction | ADE | Relation Strict F183.81 | 20 | |
| Sentiment Triplet Extraction | lap 14 | F1 Score65.23 | 17 | |
| Event extraction | CASIE | -- | 14 | |
| Relation Extraction | ACE Rel 05 | -- | 13 |