OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System
About
We introduce OneKE, a dockerized schema-guided knowledge extraction system, which can extract knowledge from the Web and raw PDF Books, and support various domains (science, news, etc.). Specifically, we design OneKE with multiple agents and a configure knowledge base. Different agents perform their respective roles, enabling support for various extraction scenarios. The configure knowledge base facilitates schema configuration, error case debugging and correction, further improving the performance. Empirical evaluations on benchmark datasets demonstrate OneKE's efficacy, while case studies further elucidate its adaptability to diverse tasks across multiple domains, highlighting its potential for broad applications. We have open-sourced the Code at https://github.com/zjunlp/OneKE and released a Video at http://oneke.openkg.cn/demo.mp4.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Relation Extraction | Wiki-ZSL (test) | Micro-F142.18 | 16 | |
| Relation Extraction | FewRel (test) | Micro-F139.19 | 16 | |
| Event extraction | RAMS (test) | Micro-F122.58 | 13 | |
| Named Entity Recognition | CrossNER (test) | Micro-F160.91 | 13 | |
| Event extraction | WikiEvents (test) | Micro-F112.43 | 13 | |
| Event extraction | CCF Law and FewFC (test) | CCF Law Score62.19 | 11 | |
| Relation Extraction | COAE2016 and SKE2020 (test) | COAE2016 Performance49.83 | 11 | |
| Named Entity Recognition | Weibo and Boson (test) | Weibo NER Score35.06 | 11 | |
| Cyber Threat Intelligence Named Entity Recognition | DNRTI (test) | Macro F121.57 | 5 | |
| Cyber Threat Intelligence Named Entity Recognition | LADDER | Macro F144.16 | 5 |