Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PromptRank: Unsupervised Keyphrase Extraction Using Prompt

About

The keyphrase extraction task refers to the automatic selection of phrases from a given document to summarize its core content. State-of-the-art (SOTA) performance has recently been achieved by embedding-based algorithms, which rank candidates according to how similar their embeddings are to document embeddings. However, such solutions either struggle with the document and candidate length discrepancies or fail to fully utilize the pre-trained language model (PLM) without further fine-tuning. To this end, in this paper, we propose a simple yet effective unsupervised approach, PromptRank, based on the PLM with an encoder-decoder architecture. Specifically, PromptRank feeds the document into the encoder and calculates the probability of generating the candidate with a designed prompt by the decoder. We extensively evaluate the proposed PromptRank on six widely used benchmarks. PromptRank outperforms the SOTA approach MDERank, improving the F1 score relatively by 34.18%, 24.87%, and 17.57% for 5, 10, and 15 returned results, respectively. This demonstrates the great potential of using prompt for unsupervised keyphrase extraction. We release our code at https://github.com/HLT-NLP/PromptRank.

Aobo Kong, Shiwan Zhao, Hao Chen, Qicheng Li, Yong Qin, Ruiqi Sun, Xiaoyan Bai• 2023

Related benchmarks

TaskDatasetResultRank
Multi-hop Question AnsweringHotpotQA (test)--
198
Keyword ExtractionSemEval 2010
F1 Score (k=10)20.66
31
Keyphrase ExtractionSemEval 2017
F1@527.14
23
Keyphrase GenerationKP20k (test)
SemP41.4
23
Keyphrase GenerationKPTimes (test)
Semantic Precision (SemP)42.7
21
Keyphrase ExtractionInspec (test)
F1@531.73
15
Keyphrase ExtractionSemEval 2010 (test)
F1@517.24
14
Keyphrase ExtractionKrapivin (test)
F1@516.11
11
Keyphrase ExtractionInspec
F1 Score @ 531.73
9
Keyphrase ExtractionDUC 2001
F1 Score @ 527.39
9
Showing 10 of 12 rows

Other info

Code

Follow for update