GPT-NeoX-20B: An Open-Source Autoregressive Language Model

About

We introduce GPT-NeoX-20B, a 20 billion parameter autoregressive language model trained on the Pile, whose weights will be made freely and openly available to the public through a permissive license. It is, to the best of our knowledge, the largest dense autoregressive model that has publicly available weights at the time of submission. In this work, we describe \model{}'s architecture and training and evaluate its performance on a range of language-understanding, mathematics, and knowledge-based tasks. We find that GPT-NeoX-20B is a particularly powerful few-shot reasoner and gains far more in performance when evaluated five-shot than similarly sized GPT-3 and FairSeq models. We open-source the training and evaluation code, as well as the model weights, at https://github.com/EleutherAI/gpt-neox.

Sid Black, Stella Biderman, Eric Hallahan, Quentin Anthony, Leo Gao, Laurence Golding, Horace He, Connor Leahy, Kyle McDonell, Jason Phang, Michael Pieler, USVSN Sai Prashanth, Shivanshu Purohit, Laria Reynolds, Jonathan Tow, Ben Wang, Samuel Weinbach• 2022

Related benchmarks

Task	Dataset	Result
Commonsense Reasoning	HellaSwag	Accuracy68.37	1896
Question Answering	ARC Challenge	--	906
Multi-task Language Understanding	MMLU	--	881
Commonsense Reasoning	PIQA	Accuracy77.9	757
Code Generation	HumanEval (test)	Pass@115.4	612
Natural Language Inference	RTE	Accuracy53.79	590
Named Entity Recognition	CoNLL 2003 (test)	--	556
Question Answering	OpenBookQA	Accuracy32.6	465
Mathematical Reasoning	MATH (test)	--	433
Physical Interaction Question Answering	PIQA	Accuracy77.4	415

Showing 10 of 86 rows

...

Other info

Code

Follow for update

@wizwand_team Discord