Improved Relation Extraction with Feature-Rich Compositional Embedding Models
About
Compositional embedding models build a representation (or embedding) for a linguistic structure based on its component word embeddings. We propose a Feature-rich Compositional Embedding Model (FCM) for relation extraction that is expressive, generalizes to new domains, and is easy-to-implement. The key idea is to combine both (unlexicalized) hand-crafted features with learned word embeddings. The model is able to directly tackle the difficulties met by traditional compositional embeddings models, such as handling arbitrary types of sentence annotations and utilizing global information for composition. We test the proposed model on two relation extraction tasks, and demonstrate that our model outperforms both previous compositional models and traditional feature rich models on the ACE 2005 relation extraction task, and the SemEval 2010 relation classification task. The combination of our model and a log-linear classifier with hand-crafted features gives state-of-the-art results.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Relation Classification | SemEval-2010 Task 8 (test) | F1 Score83.4 | 128 | |
| Relation Extraction | NYT (test) | F1 Score24 | 85 | |
| Relation Extraction | ACE05 (test) | -- | 72 | |
| Relation Extraction | Wiki-KBP (test) | F1 Score30.1 | 59 | |
| Relation Extraction | ACE bc 2005 (test) | Precision74.39 | 22 | |
| Relation Extraction | ACE out-of-domain cts 2005 (test) | Precision74.53 | 14 | |
| Relation Extraction | BioInfer (test) | Precision0.535 | 11 | |
| Relation Extraction | ACE wl domain 2005 (test) | Precision65.63 | 10 | |
| Relation Classification | NYT (test) | Accuracy68.8 | 10 | |
| Relation Classification | Wiki-KBP (test) | Accuracy61.7 | 10 |