Self-Instruct: Aligning Language Models with Self-Generated Instructions

About

Large "instruction-tuned" language models (i.e., finetuned to respond to instructions) have demonstrated a remarkable ability to generalize zero-shot to new tasks. Nevertheless, they depend heavily on human-written instruction data that is often limited in quantity, diversity, and creativity, therefore hindering the generality of the tuned model. We introduce Self-Instruct, a framework for improving the instruction-following capabilities of pretrained language models by bootstrapping off their own generations. Our pipeline generates instructions, input, and output samples from a language model, then filters invalid or similar ones before using them to finetune the original model. Applying our method to the vanilla GPT3, we demonstrate a 33% absolute improvement over the original model on Super-NaturalInstructions, on par with the performance of InstructGPT-001, which was trained with private user data and human annotations. For further evaluation, we curate a set of expert-written instructions for novel tasks, and show through human evaluation that tuning GPT3 with Self-Instruct outperforms using existing public instruction datasets by a large margin, leaving only a 5% absolute gap behind InstructGPT-001. Self-Instruct provides an almost annotation-free method for aligning pre-trained language models with instructions, and we release our large synthetic dataset to facilitate future studies on instruction tuning. Our code and data are available at https://github.com/yizhongw/self-instruct.

Yizhong Wang, Yeganeh Kordi, Swaroop Mishra, Alisa Liu, Noah A. Smith, Daniel Khashabi, Hannaneh Hajishirzi• 2022

Related benchmarks

Task	Dataset	Result
Visual Question Answering	TextVQA	Accuracy81.83	1453
Visual Question Answering	VQA v2	--	1429
Mathematical Reasoning	MATH	Accuracy7.13	882
Language Understanding	MMLU	Accuracy70.2	844
Visual Question Answering	GQA	Accuracy59.57	524
Code Generation	MBPP (test)	--	405
Mathematical Reasoning	MathVista	Accuracy66.5	382
Mathematical Reasoning	Minerva	Pass@1 Accuracy35.79	289
Mathematical Reasoning	AIME 2024	Pass@1 Accuracy22	236
Mathematical Reasoning	GSM8K	--	204

Showing 10 of 71 rows

...

Other info

Follow for update

@wizwand_team Discord