Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Short Text Clustering with Transformers

About

Recent techniques for the task of short text clustering often rely on word embeddings as a transfer learning component. This paper shows that sentence vector representations from Transformers in conjunction with different clustering methods can be successfully applied to address the task. Furthermore, we demonstrate that the algorithm of enhancement of clustering via iterative classification can further improve initial clustering performance with different classifiers, including those based on pre-trained Transformer language models.

Leonid Pugachev, Mikhail Burtsev• 2021

Related benchmarks

TaskDatasetResultRank
Short Text ClusteringTweet--
28
Short Text ClusteringAG News (test)
Accuracy86.53
18
Short Text ClusteringStack Overflow (test)
Accuracy84.72
5
Short Text ClusteringSearch Snippets (test)
Accuracy87.67
5
Short Text ClusteringBiomedical corpus (test)
Accuracy47.78
5
Short Text ClusteringGoogle News Title only--
5
Short Text ClusteringGoogle News Title and Snippet--
1
Short Text ClusteringGoogle News S Snippet only--
1
Showing 8 of 8 rows

Other info

Follow for update