Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models

About

One of the primary driving forces contributing to the superior performance of Large Language Models (LLMs) is the extensive availability of human-annotated natural language data, which is used for alignment fine-tuning. This inspired researchers to investigate self-training methods to mitigate the extensive reliance on human annotations. However, the current success of self-training has been primarily observed in natural language scenarios, rather than in the increasingly important neural-symbolic scenarios. To this end, we propose an environment-guided neural-symbolic self-training framework named ENVISIONS. It aims to overcome two main challenges: (1) the scarcity of symbolic data, and (2) the limited proficiency of LLMs in processing symbolic language. Extensive evaluations conducted on three distinct domains demonstrate the effectiveness of our approach. Additionally, we have conducted a comprehensive analysis to uncover the factors contributing to ENVISIONS's success, thereby offering valuable insights for future research in this area. Code will be available at \url{https://github.com/xufangzhi/ENVISIONS}.

Fangzhi Xu, Qiushi Sun, Kanzhi Cheng, Jun Liu, Yu Qiao, Zhiyong Wu• 2024

Related benchmarks

TaskDatasetResultRank
AgentMiniWob++ (held-in)
Performance (%)87.12
14
Logical reasoningProofWriter (held-out)
Performance0.5483
14
Logical reasoningRuleTaker (held-out)
Performance (%)62.63
14
Math ReasoningGSM8K (held-in)
Performance (%)68.31
14
Math ReasoningMATH (held-out)
Performance26.04
14
Math ReasoningGSM-H (held-out)
Accuracy (%)57.54
14
Math ReasoningSVAMP (held-out)
Performance78.3
14
Math ReasoningASDiv (held-out)
Performance75.52
14
Showing 8 of 8 rows

Other info

Code

Follow for update