Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Formula-One Prompting: A Composable Equation-First Prefix for Applied Mathematics

About

This paper introduces Formula Prompting (FP) and Formula-One Prompting (F-1), two single-call methods that elicit governing equations before solving applied-math problems. Chain-of-Thought (CoT) and Program-of-Thought (PoT) prompting improve mathematical reasoning by eliciting reasoning traces or code-like structures learned during pretraining. This suggests a diagnostic question: which useful pretraining patterns remain under-elicited? Using infini-gram-mini, we scan 81.7 trillion pretraining tokens and find that, in curated corpora such as DataComp-LM, equation-centered language appears 121x more often than code and 3.79x more often than step-by-step narration, yet standard prompting methods do not explicitly elicit equation formulation. FP asks the model to formalize a problem's governing equations before solving; F-1 extends FP with a composable Phase 2 that selects Direct, CoT, or PoT-style solving in the same call. Across five reasoning models and four applied-math benchmarks (finance, physics, cryptography, competition math), F-1 outperforms CoT by 5.76 pp and PoT by 8.42 pp on average, with the largest gain of 13.30 pp on FinanceMath, while topping the accuracy-token efficiency frontier at only 68 prompt tokens of overhead. Variant ablations identify the equation-formalization prefix, not the strategy menu, as the primary driver: adding CoT or PoT on top of the prefix yields no further gain, and 73.3% of remaining failures occur downstream of a correct Phase-1 equation.

Natapong Nitarach, Pittawat Taveekitworachai, Kunat Pipatanakul• 2026

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningOlympiadBench
Accuracy65.81
57
Mathematical ReasoningIMO-Bench
Accuracy57.02
57
Mathematical ReasoningFinanceMath
Accuracy64
20
Mathematical ReasoningOverall Macro-average
Accuracy (%)70.97
20
Mathematical ReasoningAICrypto
Accuracy0.985
20
Cryptographic ProofAICrypto (test)
Efficiency Ratio1.42
4
Cryptography ReasoningAICrypto n=18
Tokens per Correct8.43e+3
4
Financial CalculationFinanceMath (test)
Efficiency Ratio3.04
4
Financial Mathematical ReasoningFinanceMath n=200
Tokens per Correct4.37e+3
4
Olympiad Mathematical ReasoningOlympiadBench
Tokens per Correct2.55e+4
4
Showing 10 of 13 rows

Other info

Follow for update