Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

A Clustering Framework for Unsupervised and Semi-supervised New Intent Discovery

About

New intent discovery is of great value to natural language processing, allowing for a better understanding of user needs and providing friendly services. However, most existing methods struggle to capture the complicated semantics of discrete text representations when limited or no prior knowledge of labeled data is available. To tackle this problem, we propose a novel clustering framework, USNID, for unsupervised and semi-supervised new intent discovery, which has three key technologies. First, it fully utilizes unsupervised or semi-supervised data to mine shallow semantic similarity relations and provide well-initialized representations for clustering. Second, it designs a centroid-guided clustering mechanism to address the issue of cluster allocation inconsistency and provide high-quality self-supervised targets for representation learning. Third, it captures high-level semantics in unsupervised or semi-supervised data to discover fine-grained intent-wise clusters by optimizing both cluster-level and instance-level objectives. We also propose an effective method for estimating the cluster number in open-world scenarios without knowing the number of new intents beforehand. USNID performs exceptionally well on several benchmark intent datasets, achieving new state-of-the-art results in unsupervised and semi-supervised new intent discovery and demonstrating robust performance with different cluster numbers.

Hanlei Zhang, Hua Xu, Xin Wang, Fei Long, Kai Gao• 2023

Related benchmarks

TaskDatasetResultRank
New Intent DiscoveryBANKING
NMI87.67
76
New Intent DiscoveryM-CID
NMI79.04
75
Generalized Category DiscoveryBanking (test)
Accuracy73.27
28
Generalized Category DiscoveryCLINC (test)
Accuracy87.22
28
Generalized Category DiscoveryStackOverflow (test)
Accuracy82.06
28
New Intent DiscoveryStackOverflow
NMI80.01
27
New Intent DiscoveryCLINC
NMI96.46
20
New Intent DiscoverySNIPS
NMI93.32
19
New Intent DiscoveryDBpedia
NMI86.29
19
Multimodal semantics discoveryMIntRec (test)
NMI47.91
6
Showing 10 of 12 rows

Other info

Follow for update