Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

An Empirical Study on Information Extraction using Large Language Models

About

Human-like large language models (LLMs), especially the most powerful and popular ones in OpenAI's GPT family, have proven to be very helpful for many natural language processing (NLP) related tasks. Therefore, various attempts have been made to apply LLMs to information extraction (IE), which is a fundamental NLP task that involves extracting information from unstructured plain text. To demonstrate the latest representative progress in LLMs' information extraction ability, we assess the information extraction ability of GPT-4 (the latest version of GPT at the time of writing this paper) from four perspectives: Performance, Evaluation Criteria, Robustness, and Error Types. Our results suggest a visible performance gap between GPT-4 and state-of-the-art (SOTA) IE methods. To alleviate this problem, considering the LLMs' human-like characteristics, we propose and analyze the effects of a series of simple prompt-based methods, which can be generalized to other LLMs and NLP tasks. Rich experiments show our methods' effectiveness and some of their remaining issues in improving GPT-4's information extraction ability.

Ridong Han, Chaohao Yang, Tao Peng, Prayag Tiwari, Xiang Wan, Lu Liu, Benyou Wang• 2023

Related benchmarks

TaskDatasetResultRank
aspect sentiment triplet extractionLap14 D20b (test)
F1 Score39.01
15
aspect sentiment triplet extractionRes15 D20b (test)
F1 Score47.88
15
aspect sentiment triplet extractionRes16 D20b (test)
F1-score56.55
15
aspect sentiment triplet extractionRes14 D20b (test)
F1-score54.89
14
Aspect Sentiment Triplet Extraction (ASTE)D20a Lap14
F1 Score44.63
12
Aspect Sentiment Triplet Extraction (ASTE)D20a Res14
F1 Score63.57
12
Aspect Sentiment Triplet Extraction (ASTE)D20a Res15
F1-score56.53
12
Aspect Sentiment Triplet Extraction (ASTE)D20a Res16
F1-score64.68
12
Aspect-Opinion Pair Extraction (AOPE)D20a Res15
F1-score65.71
8
Aspect-Opinion Pair Extraction (AOPE)D20a Lap14
F1-score52.7
8
Showing 10 of 12 rows

Other info

Follow for update