Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs

About

Reasoning is a fundamental component of language understanding. Recent prompting techniques, such as chain of thought, have consistently improved LLMs' performance on various reasoning tasks. Nevertheless, there is still little understanding of what triggers reasoning abilities in LLMs in the inference stage. In this paper, we introduce code prompting, a chain of prompts that transforms a natural language problem into code and directly prompts the LLM using the generated code without resorting to external code execution. We hypothesize that code prompts can elicit certain reasoning capabilities of LLMs trained on text and code and utilize the proposed method to improve conditional reasoning, the ability to infer different conclusions depending on the fulfillment of certain conditions. We find that code prompting exhibits a high-performance boost for multiple LLMs (up to 22.52 percentage points on GPT 3.5, 7.75 on Mixtral, and 16.78 on Mistral) across multiple conditional reasoning datasets. We then conduct comprehensive experiments to understand how code prompts trigger reasoning abilities and which capabilities are elicited in the underlying models. Our analysis of GPT 3.5 reveals that the code formatting of the input problem is essential for performance improvement. Furthermore, code prompts improve sample efficiency of in-context learning and facilitate state tracking of variables or entities.

Haritz Puerto, Martin Tutek, Somak Aditya, Xiaodan Zhu, Iryna Gurevych• 2024

Related benchmarks

TaskDatasetResultRank
Commonsense ReasoningCommonsense Reasoning Suite (test)
HellaSwag Accuracy0.58
62
Instruction FollowingInstruction-Following Suite
IFEval Score52
18
General LLM EvaluationInstruction-Following, Mathematics, and Commonsense Reasoning Combined
Average Score47
18
MathematicsMathematics Suite
GSM8K Accuracy30
18
Showing 4 of 4 rows

Other info

Follow for update