Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Unsupervised Open-domain Keyphrase Generation

About

In this work, we study the problem of unsupervised open-domain keyphrase generation, where the objective is a keyphrase generation model that can be built without using human-labeled data and can perform consistently across domains. To solve this problem, we propose a seq2seq model that consists of two modules, namely \textit{phraseness} and \textit{informativeness} module, both of which can be built in an unsupervised and open-domain fashion. The phraseness module generates phrases, while the informativeness module guides the generation towards those that represent the core concepts of the text. We thoroughly evaluate our proposed method using eight benchmark datasets from different domains. Results on in-domain datasets show that our approach achieves state-of-the-art results compared with existing unsupervised models, and overall narrows the gap between supervised and unsupervised methods down to about 16\%. Furthermore, we demonstrate that our model performs consistently across domains, as it overall surpasses the baselines on out-of-domain datasets.

Lam Thanh Do, Pritom Saha Akash, Kevin Chen-Chuan Chang• 2023

Related benchmarks

TaskDatasetResultRank
Present Keyphrase PredictionKrapivin
F1@521.4
15
Present Keyphrase GenerationStackExchange
F1@327.2
8
Present Keyphrase GenerationOpenKP
F1@315.98
8
Present Keyphrase GenerationNUS
F1@30.264
8
Present Keyphrase GenerationDUC 2001
F1 Score @ 315.23
8
Present Keyphrase GenerationKPTimes
F1@320.11
8
Present Keyphrase GenerationSemEval
F1@319.1
8
Present Keyphrase GenerationInspec
F1@319.8
8
Absent Keyphrase GenerationInspec--
7
Absent Keyphrase GenerationKPTimes
R@5310
4
Showing 10 of 15 rows

Other info

Code

Follow for update