Universal Language Model Fine-tuning for Text Classification

About

Inductive transfer learning has greatly impacted computer vision, but existing approaches in NLP still require task-specific modifications and training from scratch. We propose Universal Language Model Fine-tuning (ULMFiT), an effective transfer learning method that can be applied to any task in NLP, and introduce techniques that are key for fine-tuning a language model. Our method significantly outperforms the state-of-the-art on six text classification tasks, reducing the error by 18-24% on the majority of datasets. Furthermore, with only 100 labeled examples, it matches the performance of training from scratch on 100x more data. We open-source our pretrained models and code.

Jeremy Howard, Sebastian Ruder• 2018

Related benchmarks

Task	Dataset	Result
Mathematical Reasoning	GSM8K (test)	Accuracy63	954
Natural Language Inference	RTE	Accuracy57.8	590
Image Classification	DTD	Accuracy67.85	487
Image Classification	SVHN	Accuracy94	395
Sentiment Analysis	IMDB (test)	Accuracy88.6	306
Text Classification	AG News (test)	Accuracy84.14	293
Image Classification	FGVCAircraft	Accuracy54.77	289
Text Classification	TREC	Accuracy96.4	281
Math Reasoning	GSM8K	Accuracy47.31	254
Commonsense Reasoning	Commonsense Reasoning (BoolQ, PIQA, SIQA, HellaS., WinoG., ARC-e, ARC-c, OBQA)	BoolQ Accuracy71.16	223

Showing 10 of 73 rows

...

Other info

Code

Follow for update

@wizwand_team Discord