Deep Unknown Intent Detection with Margin Loss
About
Identifying the unknown (novel) user intents that have never appeared in the training set is a challenging task in the dialogue system. In this paper, we present a two-stage method for detecting unknown intents. We use bidirectional long short-term memory (BiLSTM) network with the margin loss as the feature extractor. With margin loss, we can learn discriminative deep features by forcing the network to maximize inter-class variance and to minimize intra-class variance. Then, we feed the feature vectors to the density-based novelty detection algorithm, local outlier factor (LOF), to detect unknown intents. Experiments on two benchmark datasets show that our method can yield consistent improvements compared with the baseline methods.
Ting-En Lin, Hua Xu• 2019
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Unknown Intent Detection | ATIS (test) | Macro F169.6 | 20 | |
| Unknown Intent Detection | Snips (test) | Macro F184.1 | 15 | |
| Unknown Intent Detection | StackOverflow 50% seen classes (test) | Accuracy64.34 | 11 | |
| Open Intent Classification | BANKING 50% known classes (test) | Accuracy83.35 | 10 | |
| OOD Intent Detection | SNIPS standard (test) | Macro F177.7 | 10 | |
| OOD Intent Detection | Persian-ATIS standard (test) | Macro F117.81 | 10 | |
| Unknown Intent Detection | M-CID-EN 75% seen classes (test) | Accuracy77.11 | 6 | |
| Unknown Intent Detection | Banking 25% seen classes (test) | Accuracy52.77 | 6 | |
| Unknown Intent Detection | CLINC150 50% seen classes (test) | Accuracy78.63 | 6 | |
| Unknown Intent Detection | CLINC150 75% seen classes (test) | Accuracy84.59 | 6 |
Showing 10 of 24 rows