Zero-Shot Multi-Label Topic Inference with Sentence Encoders

About

Sentence encoders have indeed been shown to achieve superior performances for many downstream text-mining tasks and, thus, claimed to be fairly general. Inspired by this, we performed a detailed study on how to leverage these sentence encoders for the "zero-shot topic inference" task, where the topics are defined/provided by the users in real-time. Extensive experiments on seven different datasets demonstrate that Sentence-BERT demonstrates superior generality compared to other encoders, while Universal Sentence Encoder can be preferred when efficiency is a top priority.

Souvika Sarkar, Dongji Feng, Shubhra Kanti Karmaker Santu• 2023

Related benchmarks

Task	Dataset	Result
Multi-Label Classification	medical	Micro F1-Score59.4	11
Classification	medical	F1 Score59.4	10
Classification	News	F1 Score51.2	3
Multi-label topic classification	News	Micro F1 Score51.2	3
Multi-label topic classification	Cell. phone	Micro F152	3
Multi-label topic classification	Digital Camera 1	Micro-Avg F1 Score50	3
Multi-label topic classification	DVD player	Micro-average F150.1	3
Multi-label topic classification	SemEval	Micro-F155	3
Classification	Cellular phone	F1 Score52	2
Classification	Digital cam 1	F1 Score50	2

Showing 10 of 16 rows

Other info

Follow for update

@wizwand_team Discord