Zero-Shot Multi-Label Topic Inference with Sentence Encoders
About
Sentence encoders have indeed been shown to achieve superior performances for many downstream text-mining tasks and, thus, claimed to be fairly general. Inspired by this, we performed a detailed study on how to leverage these sentence encoders for the "zero-shot topic inference" task, where the topics are defined/provided by the users in real-time. Extensive experiments on seven different datasets demonstrate that Sentence-BERT demonstrates superior generality compared to other encoders, while Universal Sentence Encoder can be preferred when efficiency is a top priority.
Souvika Sarkar, Dongji Feng, Shubhra Kanti Karmaker Santu• 2023
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-Label Classification | medical | Micro F1-Score59.4 | 11 | |
| Classification | medical | F1 Score59.4 | 10 | |
| Classification | News | F1 Score51.2 | 3 | |
| Multi-label topic classification | News | Micro F1 Score51.2 | 3 | |
| Multi-label topic classification | Cell. phone | Micro F152 | 3 | |
| Multi-label topic classification | Digital Camera 1 | Micro-Avg F1 Score50 | 3 | |
| Multi-label topic classification | DVD player | Micro-average F150.1 | 3 | |
| Multi-label topic classification | SemEval | Micro-F155 | 3 | |
| Classification | Cellular phone | F1 Score52 | 2 | |
| Classification | Digital cam 1 | F1 Score50 | 2 |
Showing 10 of 16 rows