Magicoder: Empowering Code Generation with OSS-Instruct

About

We introduce Magicoder, a series of fully open-source (code, weights, and data) Large Language Models (LLMs) for code that significantly closes the gap with top code models while having no more than 7B parameters. Magicoder models are trained on 75K synthetic instruction data using OSS-Instruct, a novel approach to enlightening LLMs with open-source code snippets to generate diverse instruction data for code. Our main motivation is to mitigate the inherent bias of the synthetic data generated by LLMs through the wealth of open-source references for the production of more realistic and controllable data. The orthogonality of OSS-Instruct and other data generation methods like Evol-Instruct further enables us to build an enhanced MagicoderS. Both Magicoder and MagicoderS substantially outperform state-of-the-art code models with similar or even larger sizes on a wide range of coding benchmarks. Notably, MagicoderS-CL-7B based on CodeLlama even surpasses the prominent ChatGPT on HumanEval+ (66.5 vs. 65.9 in pass@1 ). Overall, OSS-Instruct opens a new direction for crafting diverse synthetic instruction data for code using abundant open-source references.

Yuxiang Wei, Zhe Wang, Jiawei Liu, Yifeng Ding, Lingming Zhang• 2023

Related benchmarks

Task	Dataset	Result
Code Generation	HumanEval	Pass@176.8	1048
Mathematical Reasoning	GSM8K (test)	Accuracy18.8	816
Code Generation	HumanEval (test)	Pass@176.8	701
Code Generation	MBPP (test)	--	411
Code Generation	HumanEval+	Pass@165.2	393
Multitask Language Understanding	MMLU (test)	Accuracy34.4	322
Code Generation	MBPP+	Pass@156.1	238
Code Generation	MBPP	Pass@164.2	193
Code Generation	HumanEval 1.0 (test)	Pass@10.732	145
Reasoning	BBH (test)	Accuracy43.8	94

Showing 10 of 21 rows

Other info

Follow for update

@wizwand_team Discord