Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Chain of Draft: Thinking Faster by Writing Less

About

Large Language Models (LLMs) have demonstrated remarkable performance in solving complex reasoning tasks through mechanisms like Chain-of-Thought (CoT) prompting, which emphasizes verbose, step-by-step reasoning. However, humans typically employ a more efficient strategy: drafting concise intermediate thoughts that capture only essential information. In this work, we propose Chain of Draft (CoD), a novel paradigm inspired by human cognitive processes, where LLMs generate minimalistic yet informative intermediate reasoning outputs while solving tasks. By reducing verbosity and focusing on critical insights, CoD matches or surpasses CoT in accuracy while using as little as only 7.6% of the tokens, significantly reducing cost and latency across various reasoning tasks. Our code and data are available at https://github.com/sileix/chain-of-draft.

Silei Xu, Wenhao Xie, Lingxiao Zhao, Pengcheng He• 2025

Related benchmarks

TaskDatasetResultRank
Commonsense ReasoningWinoGrande--
1085
Mathematical ReasoningMATH
Accuracy95.63
882
Mathematical ReasoningMATH500 (test)
Accuracy94.8
514
Mathematical ReasoningGSM8K
Accuracy83.2
499
Mathematical ReasoningMathVista
Score89.1
385
Multimodal UnderstandingMMStar--
324
Multi-discipline Multimodal UnderstandingMMMU
Accuracy58.9
317
Optical Character RecognitionOCRBench
Score89.1
232
Visual Mathematical ReasoningMathVision
Accuracy22.7
186
Mathematical ReasoningAIME 24
Accuracy48.54
154
Showing 10 of 59 rows

Other info

Follow for update