HPT: Hierarchy-aware Prompt Tuning for Hierarchical Text Classification

About

Hierarchical text classification (HTC) is a challenging subtask of multi-label classification due to its complex label hierarchy. Recently, the pretrained language models (PLM)have been widely adopted in HTC through a fine-tuning paradigm. However, in this paradigm, there exists a huge gap between the classification tasks with sophisticated label hierarchy and the masked language model (MLM) pretraining tasks of PLMs and thus the potentials of PLMs can not be fully tapped. To bridge the gap, in this paper, we propose HPT, a Hierarchy-aware Prompt Tuning method to handle HTC from a multi-label MLM perspective. Specifically, we construct a dynamic virtual template and label words that take the form of soft prompts to fuse the label hierarchy knowledge and introduce a zero-bounded multi-label cross entropy loss to harmonize the objectives of HTC and MLM. Extensive experiments show HPT achieves state-of-the-art performances on 3 popular HTC datasets and is adept at handling the imbalance and low resource situations. Our code is available at https://github.com/wzh9969/HPT.

Zihan Wang, Peiyi Wang, Tianyu Liu, Binghuai Lin, Yunbo Cao, Zhifang Sui, Houfeng Wang• 2022

Related benchmarks

Task	Dataset	Result
Hierarchical Text Classification	WOS	Macro-F181.24	78
Hierarchical Text Classification	RCV1 v2	Macro-F169.16	68
Hierarchical Text Classification	AAPD	Macro-F162.94	33
Hierarchical Text Classification	NYT	Macro F145.21	31
Hierarchical Text Classification	Web-of-Science (WOS) Depth 2 (test)	Micro-F179.85	25
Hierarchical Text Classification	DBpedia Depth 3	Macro-F193.34	24
Hierarchical Text Classification	RCV1 Depth 4 V2	Micro-F165.73	20
Hierarchical Text Classification	WOS few-shot	Micro-F180.69	20
Hierarchical Text Classification	BGC	Macro-F152.9	18
Hierarchical Text Classification	WOS full-shot	Micro-F187.1	5

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord