Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models

About

One of the primary driving forces contributing to the superior performance of Large Language Models (LLMs) is the extensive availability of human-annotated natural language data, which is used for alignment fine-tuning. This inspired researchers to investigate self-training methods to mitigate the extensive reliance on human annotations. However, the current success of self-training has been primarily observed in natural language scenarios, rather than in the increasingly important neural-symbolic scenarios. To this end, we propose an environment-guided neural-symbolic self-training framework named ENVISIONS. It aims to overcome two main challenges: (1) the scarcity of symbolic data, and (2) the limited proficiency of LLMs in processing symbolic language. Extensive evaluations conducted on three distinct domains demonstrate the effectiveness of our approach. Additionally, we have conducted a comprehensive analysis to uncover the factors contributing to ENVISIONS's success, thereby offering valuable insights for future research in this area. Code will be available at \url{https://github.com/xufangzhi/ENVISIONS}.

Fangzhi Xu, Qiushi Sun, Kanzhi Cheng, Jun Liu, Yu Qiao, Zhiyong Wu• 2024

Related benchmarks

Task	Dataset	Result
Agent	MiniWob++ (held-in)	Performance (%)87.12	14
Logical reasoning	ProofWriter (held-out)	Performance0.5483	14
Logical reasoning	RuleTaker (held-out)	Performance (%)62.63	14
Math Reasoning	GSM8K (held-in)	Performance (%)68.31	14
Math Reasoning	MATH (held-out)	Performance26.04	14
Math Reasoning	GSM-H (held-out)	Accuracy (%)57.54	14
Math Reasoning	SVAMP (held-out)	Performance78.3	14
Math Reasoning	ASDiv (held-out)	Performance75.52	14

Showing 8 of 8 rows

Other info

Code

Follow for update

@wizwand_team Discord