Emergent Abilities of Large Language Models

About

Scaling up language models has been shown to predictably improve performance and sample efficiency on a wide range of downstream tasks. This paper instead discusses an unpredictable phenomenon that we refer to as emergent abilities of large language models. We consider an ability to be emergent if it is not present in smaller models but is present in larger models. Thus, emergent abilities cannot be predicted simply by extrapolating the performance of smaller models. The existence of such emergence implies that additional scaling could further expand the range of capabilities of language models.

Jason Wei, Yi Tay, Rishi Bommasani, Colin Raffel, Barret Zoph, Sebastian Borgeaud, Dani Yogatama, Maarten Bosma, Denny Zhou, Donald Metzler, Ed H. Chi, Tatsunori Hashimoto, Oriol Vinyals, Percy Liang, Jeff Dean, William Fedus• 2022

Related benchmarks

Task	Dataset	Result
Classification	Credit-g	ROC AUC0.5529	53
Classification	blood	ROC-AUC0.5899	47
Semantic Antonym Prediction	Antonym	Accuracy67	44
Machine Translation	English-French	Accuracy74.5	42
Tabular Classification	Heart	Mean AUC-ROC67	31
Knowledge Retrieval / Relation Prediction	Person-Instrument	Accuracy0.75	30
Tabular Classification	Diabetes	AUC72.21	24
Tabular Classification	Adult	AUC0.795	24
Tabular Classification	AMAZON	AUC0.4885	24
In-Context Learning	NLP Task Suite (Capitalize, Country-Capital, Present-Past, Singular-Plural, Person-Sport, AG News) (test)	Capitalize99.9	20

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord