Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Emergent Abilities of Large Language Models

About

Scaling up language models has been shown to predictably improve performance and sample efficiency on a wide range of downstream tasks. This paper instead discusses an unpredictable phenomenon that we refer to as emergent abilities of large language models. We consider an ability to be emergent if it is not present in smaller models but is present in larger models. Thus, emergent abilities cannot be predicted simply by extrapolating the performance of smaller models. The existence of such emergence implies that additional scaling could further expand the range of capabilities of language models.

Jason Wei, Yi Tay, Rishi Bommasani, Colin Raffel, Barret Zoph, Sebastian Borgeaud, Dani Yogatama, Maarten Bosma, Denny Zhou, Donald Metzler, Ed H. Chi, Tatsunori Hashimoto, Oriol Vinyals, Percy Liang, Jeff Dean, William Fedus• 2022

Related benchmarks

TaskDatasetResultRank
Semantic Antonym PredictionAntonym
Accuracy67
44
Machine TranslationEnglish-French
Accuracy74.5
42
Knowledge Retrieval / Relation PredictionPerson-Instrument
Accuracy0.75
30
In-Context LearningNLP Task Suite (Capitalize, Country-Capital, Present-Past, Singular-Plural, Person-Sport, AG News) (test)
Capitalize99.9
20
Word Relation PredictionLandmark-Continent
Accuracy87
20
Word Relation PredictionPerson-Occupation
Accuracy56.1
20
Word Relation PredictionProduct-Company
Accuracy80.8
20
Showing 7 of 7 rows

Other info

Follow for update